Next Article in Journal
Diagnosis of Cardiac Surgery-Associated Acute Kidney Injury: State of the Art and Perspectives
Next Article in Special Issue
Upregulation of miRNA-200c during Disease Progression in COVID-19 Patients
Previous Article in Journal
Social Disparities in Outpatient and Inpatient Management of Pediatric Supracondylar Humerus Fractures
Previous Article in Special Issue
Combined First Month Body Weight Loss and Development of Tolerance as Predictors of 6-Month Efficacy of Mazindol in Mild and Moderate Obese Subjects
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Machine Learning Model for Predicting Hospitalization in Patients with Respiratory Symptoms during the COVID-19 Pandemic

by
Victor Muniz De Freitas
1,
Daniela Mendes Chiloff
1,
Giulia Gabriella Bosso
1,
Janaina Oliveira Pires Teixeira
1,
Isabele Cristina de Godói Hernandes
1,
Maira do Patrocínio Padilha
1,
Giovanna Corrêa Moura
1,
Luis Gustavo Modelli De Andrade
2,
Frederico Mancuso
3,
Francisco Estivallet Finamor
3,
Aluísio Marçal de Barros Serodio
4,
Jaquelina Sonoe Ota Arakaki
5,
Marair Gracio Ferreira Sartori
6,
Paulo Roberto Abrão Ferreira
7 and
Érika Bevilaqua Rangel
8,*
1
Paulista School of Medicine, Hospital São Paulo, Federal University of São Paulo, São Paulo 04038-901, Brazil
2
Department of Internal Medicine, Botucatu Medical School, University of São Paulo State, Botucatu 18618-687, Brazil
3
Discipline of Emergency Medicine, Department of Medicine, Paulista School of Medicine, Hospital São Paulo, Federal University of São Paulo, São Paulo 04038-901, Brazil
4
Sector of Bioethics, Department of Surgery, Paulista School of Medicine, Hospital São Paulo, Federal University of São Paulo, São Paulo 04038-901, Brazil
5
Pneumology Division, Department of Medicine, Paulista School of Medicine, Hospital São Paulo, Federal University of São Paulo, São Paulo 04038-901, Brazil
6
Department of Obstetrics, Paulista School of Medicine, Hospital São Paulo, Federal University of São Paulo, São Paulo 04038-901, Brazil
7
Infectious Disease Division, Department of Medicine, Paulista School of Medicine, Hospital São Paulo, Federal University of São Paulo, São Paulo 04038-901, Brazil
8
Nephrology Division, Department of Medicine, Paulista School of Medicine, Hospital São Paulo, Federal University of São Paulo, São Paulo 04038-901, Brazil
*
Author to whom correspondence should be addressed.
J. Clin. Med. 2022, 11(15), 4574; https://doi.org/10.3390/jcm11154574
Submission received: 7 July 2022 / Revised: 30 July 2022 / Accepted: 1 August 2022 / Published: 5 August 2022
(This article belongs to the Special Issue Predictive Diagnostics and Personalized Treatment)

Abstract

:
A machine learning approach is a useful tool for risk-stratifying patients with respiratory symptoms during the COVID-19 pandemic, as it is still evolving. We aimed to verify the predictive capacity of a gradient boosting decision trees (XGboost) algorithm to select the most important predictors including clinical and demographic parameters in patients who sought medical support due to respiratory signs and symptoms (RAPID RISK COVID-19). A total of 7336 patients were enrolled in the study, including 6596 patients that did not require hospitalization and 740 that required hospitalization. We identified that patients with respiratory signs and symptoms, in particular, lower oxyhemoglobin saturation by pulse oximetry (SpO2) and higher respiratory rate, fever, higher heart rate, and lower levels of blood pressure, associated with age, male sex, and the underlying conditions of diabetes mellitus and hypertension, required hospitalization more often. The predictive model yielded a ROC curve with an area under the curve (AUC) of 0.9181 (95% CI, 0.9001 to 0.9361). In conclusion, our model had a high discriminatory value which enabled the identification of a clinical and demographic profile predictive, preventive, and personalized of COVID-19 severity symptoms.

1. Introduction

The coronavirus disease 2019 (COVID-19) posed considerable health pressure on a global scale. In countries with socialized healthcare systems, the sudden influx of patients with respiratory symptoms quickly outgrew the capacity [1,2].
In the early phase of the COVID-19 pandemic, meta-analysis studies documented that dyspnea, anorexia, dizziness, and fatigue were significantly associated with the critical outcome [3]. Likewise, oxyhemoglobin saturation by pulse oximetry (SpO2), body temperature, and mean arterial pressure on admission can also predict COVID-19 patients with a high probability of mortality [4]. The COVID-19 timeline was described as the median duration from illness onset to dyspnea of 8.0 days (interquartile range, 5.0–13), whereas from onset of symptoms to first hospital admission was 7.0 days (4.0–8.0) [5,6].
Importantly, the COVID-19 burden is related to underlying comorbidities, in particular hypertension, diabetes mellitus (DM), cardiovascular disease, and chronic kidney disease, which are more likely to be presented with increasing age [7,8]. Other clinical determinants of COVID-19 severity comprise male sex, obesity, smoking, chronic obstructive pulmonary disease (COPD), cerebrovascular disease, malignancy, and chronic liver disease [8].
Therefore, the early identification of factors that predict hospitalization in patients affected by COVID-19 contributes to therapeutic decisions, patient flow management, and allocation of resources. To address these issues, machine learning algorithms allow us to assess comorbidities and clinical features most significantly associated with COVID-19 progression and mortality [9].
In line with these findings, a framework based on male sex, age ≥60 years, known contact with an infected person, and the appearance of five initial clinical symptoms, including cough, fever, headache, sore throat, shortness of breath, enabled the prediction of COVID-19 test results with high accuracy [10]. Importantly, COVID-19 symptoms can be monitored through wireless devices, such as sensors connected to the body and the information can be transmitted through the internet using cell phones. This approach has implications for the prioritization of hospital resources and prevents the spread of SARS-CoV-2 [11]. Moreover, healthcare technology, such as ultrasound and computerized tomography, can also be combined with artificial intelligence to benefit the health system response to the identification of disease clusters, monitoring of cases, mortality risk, COVID-19 diagnosis, disease management, and future pandemic waves of COVID-19 and other respiratory viruses in general [11,12].
Thus, our study aimed to define the main symptoms, signs, and demographic data that were most often associated with the risk of hospitalization in patients with respiratory signs and symptoms that were evaluated in a tertiary hospital during the first wave of COVID-19, from March to August 2020, in São Paulo, Brazil.

2. Materials and Methods

2.1. Study Population

We reviewed data from all patients referred to the Paulist School of Medicine at Hospital São Paulo, a tertiary university hospital at the Federal University of São Paulo (EPM-UNIFESP), São Paulo, SP, between March 2020 and August 2020 with respiratory syndrome during the first wave of the COVID-19 pandemic. A total of 7336 patients were enrolled in the study individuals, and 6596 patients were not hospitalized, whereas 740 patients required hospitalization. On that occasion, the Unit of Respiratory Infection was temporarily created as a reference for patients with respiratory signs and symptoms during the COVID-19 pandemic.
Exclusion criteria were patients under 18 years old and those who after medical evaluation did not meet the criteria for respiratory disease.

2.2. Assessment

We evaluated medical history, and clinical parameters including vital signs, such as blood pressure, respiratory rate, heart rate, temperature, and oxyhemoglobin saturation by pulse oximetry (SpO2), demographic data, and pre-existing comorbidities. All data were registered in the electronic health record (EHR). Hospitalization was based on medical decisions, in particular, when SpO2 was lower than 94% and respiratory rate was greater than 24 bpm. After hospitalization, more than 90% of the individuals tested positive for SARS-CoV-2 using RT-PCR from nasopharyngeal samples. For those who were not hospitalized, clinical status was monitored by phone calls to retrieve the status (hospitalization or non-hospitalization).
Data were automatically retrieved from the EHR and manually inputted by the researchers. Discrepancies were solved by one of the investigators (V.M.F.) after reviewing the entirety of the patient’s chart. When discrepancies could not be solved or data were not registered, it was marked as “not available”.
From the total of 7336 patients, missing data ranged from 0% to 29.5% of collected information. The least missing data variables were sex and age (no missing data) and the most missing data variables were respiratory rate and seasonal influenza vaccine in the current year (26.6% and 29.5%, respectively) (Table S1). Regarding comorbidities, data were missing from 11.6% to 11.7% of the total sample, the biggest being active or previous cancer (858 patients) and the least being active smoking (851 patients) (Table S1). Regarding symptoms, data were missing from 11.8% to 11.9% of the total sample, the biggest being sore throat (871 patients) and the least being sneezing, abdominal pain, and chills (866 patients) (Table S1).
The Ethics Committee from the Federal University of São Paulo approved the study (CAAE: 41400720.7.0000.5505). All the methods were performed following guidelines and regulations. In addition, this study was carried out under the Declaration of Helsinki. The requirement for informed consent was waived by the Ethics Committee because our study used anonymized data for analysis.

2.3. Statistical Analysis

2.3.1. Predictive Model

For the predictive model, the categorical variables were transformed into dummy variables. We removed the variables with more than 30% missing values and imputed the others using nearest neighbors (Table S2). Once the nearest neighbors are determined, the model is used to predictor nominal variables, and the mean is used for numeric data. Variables with zero or near-zero variance were removed from the model. To fit the boosted tree models, we used one-hot encoding, and to fit the Lasso regression we normalized the predictors.
To adjust to the class imbalance, the synthetic minority over-sampling (SMOTE) method was used to create synthetic classes in the training set (balancing). The SMOTE algorithm generated new examples of the minority class using the nearest neighbors of these cases.

2.3.2. Model Training

We split the data into derivation (training, n = 5868) and validation (test, n = 1468) data sets. To create the data sets, we used a random split stratified by the target into training (80%) and test (20%). Next, we fitted gradient boosting decision trees (XGBoost), random forest, light GBM, and Lasso regression to all available predictors. The best hyperparameters were selected for each model, using machine learning approaches by 10-fold cross-validation in a train set aimed to maximize the area under the receiver operating characteristic curve (AUC-ROC) and to reduce the bias. Additionally, we fitted a reduced model using the highest scores of the best model (top 11 predictors).

2.3.3. Assessment of Accuracy

The accuracy of the derivation cohort model was tested on the data of the validation cohort. We used the area under the receiver operating characteristic curve (AUC-ROC) and balance accuracy to discriminate the ability of the models in the train and test set. The 95% confidence intervals (CI) of AUC-ROC were estimated by bootstrap resampling (2000 samples) to reduce overfit bias. To evaluate the goodness of fit of models, the predicted versus observed target values were plotted in a confusion matrix of the validation cohort. We select the model with the highest AUC-ROC score and balanced accuracy to fit the reduced model (RAPID RISK COVID-19). The reduced model was fitted using the XGBoost algorithm.
The software R version 4.0.2 (Vienna, Austria) and the packages tidy models.

3. Results

In Figure 1, we documented the number of respiratory cases that required hospitalization (n = 740) and the cases that presented mild respiratory disease (n = 6596) during the first wave of the COVID-19 pandemic at São Paulo Hospital, Federal University of São Paulo, SP, Brazil, from March to August 2020. Therefore, the temporal distribution of evaluated patients shown in Figure 1 reflects the overall tendency registered in Brazil, except for an initial spike at end of March, when restrictive measures were established in São Paulo. During the whole interval evaluated, the number of patients hospitalized remained somewhat constant despite occasional peaks of patients, with an overall hospitalization rate of 10%.
Patients who required hospitalization were more likely to be men with a median age of 58 years (Table 1). The signs which were more often associated with the need for hospitalization comprised lower blood pressure (systolic and diastolic), higher temperature, heart rate, and respiratory rate, whereas oxygen saturation was lower (Table 1). Additionally, we verified that a group of symptoms represented a useful tool for risk-stratifying patients with severe respiratory disease. These symptoms included fatigue, dry cough, breathing difficulty, anorexia, and nausea/vomiting. Conversely, sneezing, running nose, sore throat, headache, and thoracic pain were frequently found in those patients with mild respiratory disease (Table 1). Other symptoms, such as diarrhea, productive cough, anosmia, myalgia, dysgeusia, chills, abdominal pain, and wheezing were equally found in the two groups.
Previous influenza vaccination was observed in those who required hospitalization (Table 1; 47% vs. 39%, p = 0.007), which could be explained by the highest age in this group and, therefore, higher access to influenza vaccination before the pandemic. To note, when our study was conducted, there was no report about the therapeutic potential of dexamethasone and remdesivir on this occasion.
When the underlying comorbidities were analyzed, we found that hypertension, cardio- and cerebrovascular diseases, diabetes mellitus (DM), chronic kidney disease, autoimmune disease, neoplasia, obesity, and solid organ transplant were correlated to the need for hospitalization (Table 2). Unexpectedly, asthma, COPD, and smoking were not associated with hospitalization requirements. To note, hospitalization 15 days before seeking medical assistance was reported more often in those who required hospitalization (Table 2; p < 0.001).
Next, we split the data into derivation (training, n = 5868) and validation (test, n = 1468) data sets (Table 3). The models were fitted using all available predictors that were related to symptoms at onset, vital signs at onset, comorbidities, and demography (n = 45 predictors). In both data sets, hospitalization was required in 10% of the patients (Table 3). The results showed that boosted trees (XGBoost and light GBM), random forest, and Lasso regression had values of AUC-ROC between 0.899 and 0.930 in the test set (Table 4). To fit the reduced model, we choose the top 11 predictors of the best algorithm. We selected the XGBoost as the best model (higher values of AUC-ROC combined with higher balanced accuracy in the test set).
We then fitted a reduced model using the top 11 predictors retrieved from the XGBoost algorithm, which provided the RAPID RISK COVID-19. The reduced predictive model utilizing SpO2, respiratory rate, age, duration of symptoms, sex, temperature at admission, presence of hypertension, heart rate, systolic and diastolic blood pressure at admission, and presence of DM (SHAP, Figure 2A) yielded an AUC-ROC of 0.9181 (95% CI, 0.9001 to 0.9361) (Figure 2B). We showed that the reduced model retrieved similar scores (AUC-ROC, balanced accuracy, precision, and F1-score) compared to the full model (Table 4). To note, SHAP or SHAPley Additive exPlanations is a visualization tool that can be used for making a machine learning model more explainable by visualizing its output. It can be used for explaining the prediction of XGBoost by computing the contribution of each feature to the prediction.
In Figure 3, we summarize our findings.

4. Discussion

Through this study, we were able to demonstrate that patients with respiratory signs, in particular, lower SpO2 and higher respiratory rate, fever, higher heart rate, and lower levels of blood pressure required hospitalization more often during the first wave of COVID-19 pandemic in a reference university hospital is São Paulo, Brazil. Additionally, age, male sex, longer symptom duration, and comorbidities, such as hypertension and DM, also led to an increase in hospitalization requirements.
Although our study comprised a single center, we observed that our data reflect the pattern of COVID-19 evolution during the first wave of the pandemic in Brazil, when the R0 value was estimated at 3.1 [13]. In addition, the state of São Paulo represented the largest state affected by the COVID-19 pandemic, presenting an incidence of 294 hospitalizations per 100,000 inhabitants on that occasion [14]. Most hospitalized patients were reported to be men, elderly, and present underlying comorbidities, including hypertension, chronic lung disease, DM, and cerebrovascular disease [14].
Despite not testing all patients, COVID-19 was the main cause of hospital deaths in 2020 (19.5%), exhibiting an increase of 16.7% in comparison to 2019 in Brazil [15]. Importantly, no influenza pandemic was documented during the current study, so we may assume that we were dealing mainly with cases of COVID-19. As Hospital São Paulo was a reference for respiratory symptoms during the pandemic, our data reflect, therefore, the general population.
Our model (RAPID RISK COVID-19) is a useful tool for risk-stratifying patients with respiratory symptoms, as the COVID-19 pandemic is still evolving. On top of that, a shortage of reagents can difficult the diagnosis of SARS-CoV-2 infection. Therefore, for population health policies, our findings provide evidence to guide those patients who are admitted to a basic health unit or consulted via telemedicine for the need for hospitalization. Notably, peripheral blood oxygenation, heart and respiratory rates, temperature, and blood pressure are inexpensive and easily accessible, as medical professionals can determine health care priorities with real-time assessment. This approach is of paramount importance for identifying high-risk COVID-19 patients, in particular, in low- and middle-income countries.
Lower SpO2, higher frequency respiratory rate, fever, fatigue, and cough are important features for the prediction of the COVID-19 severity and were identified as risk factors for hospitalization at the beginning of the pandemic [16,17,18]. These signs may be explained by the direct SARS-CoV-2 infection of the epithelial lung and bronchial cells, as well as, several other organs [19,20], associated with the dysregulation of the renin–angiotensin–aldosterone pathway, which leads to tissue remodeling, inflammation, vasoconstriction, and vascular permeability; endothelial cell damage and thrombo-inflammation, which promotes an imbalance of fibrinolysis and thrombin production, and dysregulated immune response, characterized by T cell lymphopenia due to exhaustion, inhibition of interferon signaling by SARS-CoV-2, and hyperactive innate immunity with higher levels of neutrophils and monocytes, which is ultimately involved in the cytokine storm [20]. SARS-CoV-2 viremia is implicated in worsening COVID-19 [21] and may lead to a systemic response, as documented by higher fever, heart rate, lower blood pressure levels, and fatigue. However, other symptoms, such as a running nose, sore throat, sneezing, headache, and olfactory abnormalities indicate mild cases of COVID-19 [22].
Importantly, machine learning algorithms may also predict the early onset of acute distress respiratory syndrome in critically ill adults with COVID-19 when oxygen saturation, respiratory rate, and blood pressure were evaluated accordingly to age and sex [23]. XGBoost models may also predict COVID-19 progression and mortality when trained with data from the last 24 h. Therefore, respiratory rate, SpO2, age greater than 75 years, and laboratory parameters (lactate dehydrogenase, calcium, glucose, and C-reactive protein) were important for risk-stratifying patients with COVID-19 during the first wave, from January to August 2000 [24], which represent the same period when our study was conducted.
To note, using machine learning approaches we fitted machine learning models that achieve higher AUC-ROC values (greater than 0.90) using 45 predictors. The performance of different models was very similar, especially in boosted trees. The models keep a similar performance in the test set that was held separate from the train data indicating a low risk of overfitting. However, in clinical practice, we fitted a reduced model using the top 11 predictors to facilitate the applicability of the algorithm. This model was named RAPID RISK COVID-19 and had performance metrics similar to the complete model in the test set.
In addition, the odds ratio for clinical decompensation within 24 h reaches 5.17 when a SpO2 < 93% is associated with leucocytosis and a low glomerular filtration rate is present in patients with heart failure [25]. Lower SpO2 values ≤ 90%, measured in an out-of-hospital setting, were associated with a greater than 50% decrease in the probability of being discharged alive, regardless of age [26], supporting, therefore, the decision for prompt hospital admission. The lower levels of PaO2/FIO2, when associated with lymphopenia, higher levels of C-reactive protein and lactate dehydrogenase, and higher chest tomography scores may also predict prolonged hospitalization [27]. Age, cardiovascular disease, CKD, dyspnoea, tachypnea, confusion, systolic blood pressure, and SpO2 ≤ 93% or oxygen requirement discriminate the composite outcome of in-hospital mortality, mechanical ventilation, or admission to the intensive care unit [28]. In a Brazilian and Spanish cohort, seven variables (age, number of comorbidities, heart rate, blood urea nitrogen, C-reactive protein, platelet count, and SpO2/FIO2) enabled early identification of risk factors predicting in-hospital mortality [29]. SpO2 can also be incorporated into the ROX index ([SpO2/FIO2]/frequency respiratory) to estimate the failure of high-flow cannula failure in COVID-19 patients with acute hypoxemic respiratory failure [30]. Clinicians can take advantage of the results obtained from these models, as an online risk calculator is already available [31].
Age is a risk factor for COVID-19 progression and mortality [7,8,14,22,32,33,34], as aging is frequently associated with the presence of a high number of underlying comorbidities [7] and changes in the immune system including a profile of exaggerated inflammatory response of the innate immune system and immunosenescence of adaptive immune system, which ultimately increases the severity of COVID-19 [35]. Age-related laboratory profile is characterized by higher levels of pro-inflammatory and tissue damage markers, in particular, in elderly men [36]. The lower response to vaccination associated with lower titles of neutralizing antibodies against SARS-CoV-2 [37] poses elderly individuals with a worse outcome.
Male sex is also associated with a higher risk for COVID-19 progression and mortality [14,32,33,38,39]. Sex disparity in COVID-19 severity may be explained by molecular and cellular features. Therefore, females have lower levels of pro-inflammatory and tissue damage markers when compared to males [36,39]. Differences in molecular mechanisms provide a survival advantage in females with COVID-19, as ACE2 and Toll-like receptors (TLR) signaling genes are located in the X chromosome, providing a more versatile and stronger immune response, whereas estrogen contributes to T-cell activation [40]. Therefore, males have a distinct interaction not only between immune cells but also between epithelium and immune cells, which explain the severe outcomes [39].
The presence of underlying comorbidities aggravates the COVID-19 setting. SARS-CoV-2 has a susceptibility to tissue invasion based on the expression of angiotensin-converting enzyme-2 (ACE2) receptors in different organs [19]. Therefore, chronic diseases favor virus entrance and increase the COVID-19 burden [22]. Endothelial damage due to direct virus toxicity and inflammatory response are hallmarks of COVID-19 severity [20,41]. Thus, the presence of cardiometabolic disease, which is another pandemic worldwide, identifies patients with a greater risk of COVID-19 severity. Therefore, hypertension is an independent risk factor for COVID-19 progression and mortality [14,16,18,38]. Blood pressure control is associated with a decrease in the COVID-19 burden and should be encouraged during the pandemic, in patients with advanced atherosclerosis and target organ damage [42].
Likewise, DM is associated with higher rates of mortality due to COVID-19 [14,16,18,32,38]. DM is often associated with hypertension, and cardio- and cerebrovascular diseases, which increase the risk for hospitalization [43]. Recent DM-related pathophysiologic mechanisms in COVID-19 unveiled direct toxicity of islet cells, increase in insulin resistance, and immune system dysfunction associated with microthrombi and endotheliitis, which aggravates glucose control in those patients with pre-existing DM and those with newly diagnosed DM [44]. Adequate blood glucose control ameliorates the COVID-19 burden [45] and should be equally pursued during the pandemic.
Cardiac disease is associated with higher expression of ACE2 and tissue susceptibility to SARS-CoV-2 infection [19], which puts patients with cardiac disease at great risk of hospitalization and mortality [18,25,32,38]. The heart is affected by the inflammatory response, in particular, the cytokine storm and macrophage activation, which promotes endotheliitis, microvascular dysfunction, and thrombosis, in addition to direct damage, an imbalance of ACE2 and angiotensin II, and an increase in myocardial oxygen consumption due to fever, hypoxemia and augmented adrenergic drive [46]. Similarly, cerebrovascular disease can also aggravate COVID-19 [14,34], as SARS-CoV-2 may directly invade neuron cells and also promote endotheliopathy and inflammation [19].
Obesity may also pose COVID-19 patients at high risk for progression [16,32,43], including young patients [47]. Adipose tissue is a reservoir for SARS-CoV-2 replication, as ACE2 is abundantly expressed, which ultimately aggravates the inflammatory milieu [19]. Obese individuals, besides the presence of previous underlying comorbidities, may have a worse pulmonary function performance, presenting lower levels of PaO2 and SatO2 at admission and the requirement for higher volumes of oxygen [48]. Obese patients express ACE2 more often in the epithelial cells of the lungs [49], which contributes to a worse prognosis.
In line with these findings, chronic kidney disease, which is mainly associated with hypertension, DM, cardio- and cerebrovascular diseases, and obesity, is also correlated to a worse COVID-19 outcome [25,32,43]. SARS-CoV-2 may affect kidney function by several mechanisms, including direct virus toxicity, cytokine storm, rhabdomyolysis, and cardiac decompensation [50].
Solid organ transplant recipients [51] and autoimmune diseases [52] are chronic conditions associated with more severe lymphopenia, a hallmark of COVID-19, and more severe disease, increased viral shedding, lower rates of seroconversion, displaying higher rates of case fatality. Although hematologic solid cancer patients exhibit prolonged viral shedding, delayed or no seroconversion, and sustained immune dysregulation following viral clearance (heterogeneous humoral response and an exhausted T cell phenotype), solid cancer patients present some prolonged viral shedding, early sustained seroconversion, and near-complete resolution of immune dysregulation following viral clearance [53]. Importantly, the need for hospitalization and mortality rates remain higher during all waves of the pandemic, which is a critical condition worldwide and prompts an immediate vaccination regimen for these individuals.
Unexpectedly, smoking and COPD were not correlated to the need for hospitalization, as opposed to the literature [14,18,32]. Both conditions are associated with higher expression of ACE2 in epithelial bronchial and pulmonary cells [19] and systemic inflammatory dysregulation [54]. However, chronic respiratory diseases, yet associated with a higher risk of COVID-19 progression and mortality, were not associated with an augmented risk of hospitalization [55]. The discrepancy in our data may be explained at least in part by the number of patients included in the study and the missing data, which contributed to the under-representation of these patients, or better adherence to precautionary measures.
The impact of asthma on COVID-19 progression is a subject of debate in the literature. Some reports indicate that asthma is an independent risk factor for COVID-19 hospitalization, but not for COVID-19 infection [56]. Our findings showed that pre-existing asthma was not a risk factor for hospitalization, as documented by a meta-analysis [57]. We speculate that better adherence to asthmatic treatments during the pandemic [58], the fact that the gene expression of ACE2, TMPRSS2, and furin is not upregulated in asthmatic patients [59], or that the differences in asthma phenotype that were not deeply analyzed across the studies [60], and the inflammatory milieu in asthmatic patients (type 2 cytokines, including IL-4 and IL-13 and accumulation of eosinophils) or the conventional therapeutics for asthma, including inhaled corticosteroids [61] may have conferred a protective effect for asthmatic patients with COVID-19.
The main limitations of the study were the retrospective analysis, the amount of missing data, and the lack of testing in those patients who were not hospitalized. Additionally, we did not provide external validation in an independent cohort.
In conclusion, while recent advances have been made in the areas of diagnostics, therapeutics, and vaccination for COVID-19, there remains a need for information management tools and a common platform for cross-border collaboration on future pandemic preparedness and response. The lessons learned during the current COVID-19 pandemic have equipped us with a toolbox to tackle future pandemics. Thus, the use of health surveillance technologies [62] associated with the clinical, laboratory, and imaging parameters [12,63,64] will pave the way for the development of predictive, preventive, and personalized solutions.
Therefore, our model had a high discriminatory value that enabled the identification of a clinical and demographic profile predictive of disease severity. Moreover, our approach assisted clinical decision triage and provided additional biological insights into disease progression. However, further research is required to determine whether this tool can also be applied to outpatient or home-based COVID-19 patients, as well as to novel SARS-CoV-2 variants and in the post-vaccination setting.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jcm11154574/s1, Table S1. Comparison of demography, signs, symptoms, and comorbidities between patients who required hospitalization and those who did not require hospitalization from March to August-2000 at São Paulo Hospital, Brazil. Table S2. Predictors and missing values numbers used in predicting hospitalization in patients with respiratory symptoms during the COVID-19 pandemic.

Author Contributions

Conceptualization: V.M.D.F., D.M.C., G.G.B., J.O.P.T., I.C.d.G.H., M.d.P.P., G.C.M., F.M., F.E.F., A.M.d.B.S., J.S.O.A., M.G.F.S., P.R.A.F. and É.B.R. Data curation: V.M.D.F., D.M.C., G.G.B., J.O.P.T., I.C.d.G.H., M.d.P.P. and G.C.M. Formal analysis: V.M.D.F., L.G.M.D.A. and É.B.R. Investigation: V.M.D.F., D.M.C., G.G.B., J.O.P.T., I.C.d.G.H., M.d.P.P., G.C.M., F.M., F.E.F., A.M.d.B.S., J.S.O.A., M.G.F.S., P.R.A.F. and É.B.R. Methodology: V.M.D.F., L.G.M.D.A. and É.B.R. Supervision: É.B.R. Validation: V.M.D.F. and É.B.R. Writing—original draft: V.M.D.F. and É.B.R. Writing—review and editing: V.M.D.F. and É.B.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was carried out following the Declaration of Helsinki and approved by the Institutional Review Board of the Federal University of São Paulo (CAAE: 41400720.7.0000.5505); approved on 6 February 2021.

Informed Consent Statement

Patient consent was waived by the Institutional Review Board due to the retrospective nature of the study, and all data collected were de-identified.

Data Availability Statement

The database is available from the corresponding author upon reasonable request.

Acknowledgments

We would like to thank the entire medical and nursing staff from Hospital São Paulo who was committed to evaluating the patients at the Unit of Respiratory Infection during the COVID-19 pandemic, as well as all medical students from Paulista School of Medicine at the Federal University of São Paulo, SP, Brazil, who were involved in the follow-up of the patients.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Verelst, F.; Kuylen, E.; Beutels, P. Indications for healthcare surge capacity in European countries facing an exponential increase in coronavirus disease (COVID-19) cases, March 2020. Eurosurveillance 2020, 25, 2000323. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Candido, D.S.; Claro, I.M.; de Jesus, J.G.; Souza, W.M.; Moreira, F.R.R.; Dellicour, S.; Mellan, T.A.; du Plessis, L.; Pereira, R.H.M.; Sales, F.C.S.; et al. Evolution and epidemic spread of SARS-CoV-2 in Brazil. Science 2020, 369, 1255–1260. [Google Scholar] [CrossRef]
  3. Kim, H.J.; Hwang, H.; Hong, H.; Yim, J.J.; Lee, J. A systematic review and meta-analysis of regional risk factors for critical outcomes of COVID-19 during early phase of the pandemic. Sci. Rep. 2021, 11, 9784. [Google Scholar] [CrossRef]
  4. Wang, H.; Ai, H.; Fu, Y.; Li, Q.; Cui, R.; Ma, X.; Ma, Y.F.; Wang, Z.; Liu, T.; Long, Y.; et al. Development of an Early Warning Model for Predicting the Death Risk of Coronavirus Disease 2019 Based on Data Immediately Available on Admission. Front. Med. 2021, 8, 699243. [Google Scholar] [CrossRef] [PubMed]
  5. Huang, C.; Wang, Y.; Li, X.; Ren, L.; Zhao, J.; Hu, Y.; Zhang, L.; Fan, G.; Xu, J.; Gu, X.; et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020, 395, 497–506. [Google Scholar] [CrossRef] [Green Version]
  6. Wang, D.; Hu, B.; Hu, C.; Zhu, F.; Liu, X.; Zhang, J.; Wang, B.; Xiang, H.; Cheng, Z.; Xiong, Y.; et al. Clinical Characteristics of 138 Hospitalized Patients with 2019 Novel Coronavirus-Infected Pneumonia in Wuhan, China. JAMA 2020, 323, 1061–1069. [Google Scholar] [CrossRef] [PubMed]
  7. Antos, A.; Kwong, M.L.; Balmorez, T.; Villanueva, A.; Murakami, S. Unusually High Risks of COVID-19 Mortality with Age-Related Comorbidities: An Adjusted Meta-Analysis Method to Improve the Risk Assessment of Mortality Using the Comorbid Mortality Data. Infect. Dis. Rep. 2021, 13, 700–711. [Google Scholar] [CrossRef]
  8. Li, X.; Zhong, X.; Wang, Y.; Zeng, X.; Luo, T.; Liu, Q. Clinical determinants of the severity of COVID-19: A systematic review and meta-analysis. PLoS ONE 2021, 16, e0250602. [Google Scholar] [CrossRef] [PubMed]
  9. Aktar, S.; Talukder, A.; Ahamad, M.M.; Kamal, A.H.M.; Khan, J.R.; Protikuzzaman, M.; Hossain, N.; Azad, A.K.M.; Quinn, J.M.W.; Summers, M.A.; et al. Machine Learning Approaches to Identify Patient Comorbidities and Symptoms That Increased Risk of Mortality in COVID-19. Diagnostics 2021, 11, 1383. [Google Scholar] [CrossRef]
  10. Zoabi, Y.; Deri-Rozov, S.; Shomron, N. Machine learning-based prediction of COVID-19 diagnosis based on symptoms. NPJ. Digit. Med. 2021, 4, 3. [Google Scholar] [CrossRef]
  11. Saeed, U.; Shah, S.Y.; Ahmad, J.; Imran, M.A.; Abbasi, Q.H.; Shah, S.A. Machine learning empowered COVID-19 patient monitoring using non-contact sensing: An extensive review. J. Pharm. Anal. 2022, 12, 193–204. [Google Scholar] [CrossRef] [PubMed]
  12. Fusco, R.; Grassi, R.; Granata, V.; Setola, S.V.; Grassi, F.; Cozzi, D.; Pecori, B.; Izzo, F.; Petrillo, A. Artificial Intelligence and COVID-19 Using Chest CT Scan and Chest X-ray Images: Machine Learning and Deep Learning Approaches for Diagnosis and Treatment. J. Pers. Med. 2021, 11, 993. [Google Scholar] [CrossRef] [PubMed]
  13. de Souza, W.M.; Buss, L.F.; Candido, D.D.S.; Carrera, J.P.; Li, S.; Zarebski, A.E.; Pereira, R.H.M.; Prete, C.A., Jr.; de Souza-Santos, A.A.; Parag, K.V.; et al. Epidemiological and clinical characteristics of the COVID-19 epidemic in Brazil. Nat. Hum. Behav. 2020, 4, 856–865. [Google Scholar] [CrossRef] [PubMed]
  14. Silva, P.V.D.; Oliveira, S.B.; Escalante, J.J.C.; Almiron, M.; Tsuha, D.H.; Sato, H.K.; Menezes, P.R.; Paula, R.C.; D’Agostini, T.L.; Croda, J. Risk Factors for Death Among 120,804 Hospitalized Patients with Confirmed COVID-19 in Sao Paulo, Brazil. Am. J. Trop. Med. Hyg. 2021, 105, 88–92. [Google Scholar] [CrossRef] [PubMed]
  15. Zimmermann, I.R.; Sanchez, M.N.; Alves, L.C.; Frio, G.S.; Cavalcante, F.V.; Cortez-Escalante, J.J.; da Silva, E.N.; Santos, L.M.P. COVID-19 as the leading cause of hospital deaths in the Brazilian public health system in 2020. Int. J. Infect. Dis. 2021, 113, 162–165. [Google Scholar] [CrossRef]
  16. Richardson, S.; Hirsch, J.S.; Narasimhan, M.; Crawford, J.M.; McGinn, T.; Davidson, K.W.; Barnaby, D.P.; Becker, L.B.; Chelico, J.D.; Cohen, S.L.; et al. Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized with COVID-19 in the New York City Area. JAMA 2020, 323, 2052–2059. [Google Scholar] [CrossRef] [PubMed]
  17. Grant, M.C.; Geoghegan, L.; Arbyn, M.; Mohammed, Z.; McGuinness, L.; Clarke, E.L.; Wade, R.G. The prevalence of symptoms in 24,410 adults infected by the novel coronavirus (SARS-CoV-2; COVID-19): A systematic review and meta-analysis of 148 studies from 9 countries. PLoS ONE 2020, 15, e0234765. [Google Scholar] [CrossRef]
  18. Yang, J.; Zheng, Y.; Gou, X.; Pu, K.; Chen, Z.; Guo, Q.; Ji, R.; Wang, H.; Wang, Y.; Zhou, Y. Prevalence of comorbidities and its effects in patients infected with SARS-CoV-2: A systematic review and meta-analysis. Int. J. Infect. Dis. 2020, 94, 91–95. [Google Scholar] [CrossRef]
  19. Beyerstedt, S.; Casaro, E.B.; Rangel, E.B. COVID-19: Angiotensin-converting enzyme 2 (ACE2) expression and tissue susceptibility to SARS-CoV-2 infection. Eur. J. Clin. Microbiol. Infect. Dis. 2021, 40, 905–919. [Google Scholar] [CrossRef]
  20. Gupta, A.; Madhavan, M.V.; Sehgal, K.; Nair, N.; Mahajan, S.; Sehrawat, T.S.; Bikdeli, B.; Ahluwalia, N.; Ausiello, J.C.; Wan, E.Y.; et al. Extrapulmonary manifestations of COVID-19. Nat. Med. 2020, 26, 1017–1032. [Google Scholar] [CrossRef] [PubMed]
  21. Li, Y.; Schneider, A.M.; Mehta, A.; Sade-Feldman, M.; Kays, K.R.; Gentili, M.; Charland, N.C.; Gonye, A.L.; Gushterova, I.; Khanna, H.K.; et al. SARS-CoV-2 viremia is associated with distinct proteomic pathways and predicts COVID-19 outcomes. J. Clin. Investig. 2021, 131, e148635. [Google Scholar] [CrossRef]
  22. Gandhi, R.T.; Lynch, J.B.; Del, R.C. Mild or Moderate COVID-19. N. Engl. J. Med. 2020, 383, 1757–1766. [Google Scholar] [CrossRef]
  23. Singhal, L.; Garg, Y.; Yang, P.; Tabaie, A.; Wong, A.I.; Mohammed, A.; Chinthala, L.; Kadaria, D.; Sodhi, A.; Holder, A.L.; et al. eARDS: A multi-center validation of an interpretable machine learning algorithm of early onset Acute Respiratory Distress Syndrome (ARDS) among critically ill adults with COVID-19. PLoS ONE 2021, 16, e0257056. [Google Scholar] [CrossRef]
  24. Wang, J.M.; Liu, W.; Chen, X.; McRae, M.P.; McDevitt, J.T.; Fenyo, D. Predictive Modeling of Morbidity and Mortality in Patients Hospitalized with COVID-19 and its Clinical Implications: Algorithm Development and Interpretation. J. Med. Internet. Res. 2021, 23, e29514. [Google Scholar] [CrossRef] [PubMed]
  25. Wang, A.Z.; Ehrman, R.; Bucca, A.; Croft, A.; Glober, N.; Holt, D.; Lardaro, T.; Musey, P.; Peterson, K.; Schaffer, J.; et al. Can we predict which COVID-19 patients will need transfer to intensive care within 24 hours of floor admission? Acad. Emerg. Med. 2021, 28, 511–518. [Google Scholar] [CrossRef] [PubMed]
  26. Lancet, E.A.; Gonzalez, D.; Alexandrou, N.A.; Zabar, B.; Lai, P.H.; Hall, C.B.; Braun, J.; Zeig-Owens, R.; Isaacs, D.; Ben-Eli, D.; et al. Prehospital hypoxemia, measured by pulse oximetry, predicts hospital outcomes during the New York City COVID-19 pandemic. J. Am. Coll. Emerg. Phys. Open 2021, 2, e12407. [Google Scholar] [CrossRef] [PubMed]
  27. Zinellu, A.; De, V.A.; Scano, V.; Paliogiannis, P.; Fiore, V.; Madeddu, G.; Maida, I.; Zinellu, E.; Mangoni, A.A.; Arru, L.B.; et al. The PaO2/FiO2 ratio on admission is independently associated with prolonged hospitalization in COVID-19 patients. J. Infect. Dev. Ctries. 2021, 15, 353–359. [Google Scholar] [CrossRef]
  28. Martinez-Lacalzada, M.; Viteri-Noel, A.; Manzano, L.; Fabregate, M.; Rubio-Rivas, M.; Luis, G.S.; Arnalich-Fernandez, F.; Beato-Perez, J.L.; Vargas-Nunez, J.A.; Calvo-Manuel, E.; et al. Predicting critical illness on initial diagnosis of COVID-19 based on easily obtained clinical variables: Development and validation of the PRIORITY model. Clin. Microbiol. Infect. 2021, 27, 1838–1844. [Google Scholar] [CrossRef]
  29. Marcolino, M.S.; Pires, M.C.; Ramos, L.E.F.; Silva, R.T.; Oliveira, L.M.; Carvalho, R.L.R.; Mourato, R.L.S.; Sanchez-Montalva, A.; Raventos, B.; Anschau, F.; et al. ABC2-SPH risk score for in-hospital mortality in COVID-19 patients: Development, external validation and comparison with other available scores. Int. J. Infect. Dis. 2021, 110, 281–308. [Google Scholar] [CrossRef]
  30. Prakash, J.; Bhattacharya, P.K.; Yadav, A.K.; Kumar, A.; Tudu, L.C.; Prasad, K. ROX index as a good predictor of high flow nasal cannula failure in COVID-19 patients with acute hypoxemic respiratory failure: A systematic review and meta-analysis. J. Crit. Care 2021, 66, 102–108. [Google Scholar] [CrossRef]
  31. Marcolino, M.S.; Ziegelmann, P.K.; Souza-Silva, M.V.R.; Nascimento, I.J.B.; Oliveira, L.M.; Monteiro, L.S.; Sales, T.L.S.; Ruschel, K.B.; Martins, K.P.M.P.; Etges, A.P.B.S.; et al. Clinical characteristics and outcomes of patients hospitalized with COVID-19 in Brazil: Results from the Brazilian COVID-19 registry. Int. J. Infect. Dis. 2021, 107, 300–310. [Google Scholar] [CrossRef] [PubMed]
  32. Docherty, A.B.; Harrison, E.M.; Green, C.A.; Hardwick, H.E.; Pius, R.; Norman, L.; Holden, K.A.; Read, J.M.; Dondelinger, F.; Carson, G.; et al. Features of 20 133 UK patients in hospital with COVID-19 using the ISARIC WHO Clinical Characterisation Protocol: Prospective observational cohort study. BMJ 2020, 369, m1985. [Google Scholar] [CrossRef] [PubMed]
  33. Castro, M.C.; Gurzenda, S.; Macario, E.M.; Franca, G.V.A. Characteristics, outcomes and risk factors for mortality of 522 167 patients hospitalised with COVID-19 in Brazil: A retrospective cohort study. BMJ Open 2021, 11, e049089. [Google Scholar] [CrossRef]
  34. Katzenschlager, S.; Zimmer, A.J.; Gottschalk, C.; Grafeneder, J.; Schmitz, S.; Kraker, S.; Ganslmeier, M.; Muth, A.; Seitel, A.; Maier-Hein, L.; et al. Can we predict the severe course of COVID-19—A systematic review and meta-analysis of indicators of clinical outcome? PLoS ONE 2021, 16, e0255154. [Google Scholar] [CrossRef] [PubMed]
  35. Bajaj, V.; Gadi, N.; Spihlman, A.P.; Wu, S.C.; Choi, C.H.; Moulton, V.R. Aging, Immunity, and COVID-19: How Age Influences the Host Immune Response to Coronavirus Infections? Front. Physiol. 2020, 11, 571416. [Google Scholar] [CrossRef] [PubMed]
  36. Ten-Caten, F.; Gonzalez-Dias, P.; Castro, I.; Ogava, R.L.T.; Giddaluru, J.; Silva, J.C.S.; Martins, F.; Goncalves, A.N.A.; Costa-Martins, A.G.; Araujo, J.D.; et al. In-depth analysis of laboratory parameters reveals the interplay between sex, age, and systemic inflammation in individuals with COVID-19. Int. J. Infect. Dis. 2021, 105, 579–587. [Google Scholar] [CrossRef]
  37. Wang, J.; Tong, Y.; Li, D.; Li, J.; Li, Y. The Impact of Age Difference on the Efficacy and Safety of COVID-19 Vaccines: A Systematic Review and Meta-Analysis. Front. Immunol. 2021, 12, 758294. [Google Scholar] [CrossRef]
  38. Espinosa, O.A.; Zanetti, A.D.S.; Antunes, E.F.; Longhi, F.G.; Matos, T.A.; Battaglini, P.F. Prevalence of comorbidities in patients and mortality cases affected by SARS-CoV2: A systematic review and meta-analysis. Rev. Inst. Med. Trop. Sao Paulo 2020, 62, e43. [Google Scholar] [CrossRef]
  39. Hou, Y.; Zhou, Y.; Gack, M.U.; Lathia, J.D.; Kallianpur, A.; Mehra, R.; Chan, T.A.; Jung, J.U.; Jehi, L.; Eng, C.; et al. Multimodal single-cell omics analysis identifies epithelium-immune cell interactions and immune vulnerability associated with sex differences in COVID-19. Signal Transduct. Target. Ther. 2021, 6, 292. [Google Scholar] [CrossRef] [PubMed]
  40. Li, Y.; Jerkic, M.; Slutsky, A.S.; Zhang, H. Molecular mechanisms of sex bias differences in COVID-19 mortality. Crit. Care 2020, 24, 405. [Google Scholar] [CrossRef]
  41. Andrianto, A.; Al-Farabi, M.J.; Nugraha, R.A.; Marsudi, B.A.; Azmi, Y. Biomarkers of endothelial dysfunction and outcomes in coronavirus disease 2019 (COVID-19) patients: A systematic review and meta-analysis. Microvasc. Res. 2021, 138, 104224. [Google Scholar] [CrossRef] [PubMed]
  42. Sheppard, J.P.; Nicholson, B.D.; Lee, J.; McGagh, D.; Sherlock, J.; Koshiaris, C.; Oke, J.; Jones, N.R.; Hinton, W.; Armitage, L.; et al. Association Between Blood Pressure Control and Coronavirus Disease 2019 Outcomes in 45 418 Symptomatic Patients with Hypertension: An Observational Cohort Study. Hypertension 2021, 77, 846–855. [Google Scholar] [CrossRef] [PubMed]
  43. Kompaniyets, L.; Pennington, A.F.; Goodman, A.B.; Rosenblum, H.G.; Belay, B.; Ko, J.Y.; Chevinsky, J.R.; Schieber, L.Z.; Summers, A.D.; Lavery, A.M.; et al. Underlying Medical Conditions and Severe Illness Among 540,667 Adults Hospitalized with COVID-19, March 2020-March 2021. Prev. Chronic. Dis. 2021, 18, e66. [Google Scholar] [CrossRef] [PubMed]
  44. Mine, K.; Nagafuchi, S.; Mori, H.; Takahashi, H.; Anzai, K. SARS-CoV-2 Infection and Pancreatic beta Cell Failure. Biology 2021, 11, 22. [Google Scholar] [CrossRef] [PubMed]
  45. Zhu, L.; She, Z.G.; Cheng, X.; Qin, J.J.; Zhang, X.J.; Cai, J.; Lei, F.; Wang, H.; Xie, J.; Wang, W.; et al. Association of Blood Glucose Control and Outcomes in Patients with COVID-19 and Pre-existing Type 2 Diabetes. Cell. Metab. 2020, 31, 1068–1077. [Google Scholar] [CrossRef] [PubMed]
  46. Italia, L.; Tomasoni, D.; Bisegna, S.; Pancaldi, E.; Stretti, L.; Adamo, M.; Metra, M. COVID-19 and Heart Failure: From Epidemiology During the Pandemic to Myocardial Injury, Myocarditis, and Heart Failure Sequelae. Front. Cardiovasc. Med. 2021, 8, 713560. [Google Scholar] [CrossRef]
  47. Zhang, F.; Xiong, Y.; Wei, Y.; Hu, Y.; Wang, F.; Li, G.; Liu, K.; Du, R.; Wang, C.Y.; Zhu, W. Obesity predisposes to the risk of higher mortality in young COVID-19 patients. J. Med. Virol. 2020, 92, 2536–2542. [Google Scholar] [CrossRef]
  48. Moriconi, D.; Masi, S.; Rebelos, E.; Virdis, A.; Manca, M.L.; De, M.S.; Taddei, S.; Nannipieri, M. Obesity prolongs the hospital stay in patients affected by COVID-19, and may impact on SARS-CoV-2 shedding. Obes. Res. Clin. Pract. 2020, 14, 205–209. [Google Scholar] [CrossRef]
  49. Higham, A.; Singh, D. Increased ACE2 Expression in Bronchial Epithelium of COPD Patients who are Overweight. Obesity 2020, 28, 1586–1589. [Google Scholar] [CrossRef]
  50. Bruchfeld, A. The COVID-19 pandemic: Consequences for nephrology. Nat. Rev. Nephrol. 2021, 17, 81–82. [Google Scholar] [CrossRef]
  51. Phadke, V.K.; Scanlon, N.; Jordan, S.C.; Rouphael, N.G. Immune Responses to SARS-CoV-2 in Solid Organ Transplant Recipients. Curr. Transplant. Rep. 2021, 8, 127–139. [Google Scholar] [CrossRef]
  52. Xiao, F.; Han, M.; Zhu, X.; Tang, Y.; Huang, E.; Zou, H.; Jiang, Q.; Lu, L. The immune dysregulations in COVID-19: Implications for the management of rheumatic diseases. Mod. Rheumatol. 2021, 31, 927–932. [Google Scholar] [CrossRef] [PubMed]
  53. Abdul-Jawad, S.; Bau, L.; Alaguthurai, T.; Del Molino, D.B.I.; Laing, A.G.; Hayday, T.S.; Monin, L.; Munoz-Ruiz, M.; McDonald, L.; Francos, Q.I.; et al. Acute Immune Signatures and Their Legacies in Severe Acute Respiratory Syndrome Coronavirus-2 Infected Cancer Patients. Cancer Cell 2021, 39, 257–275. [Google Scholar] [CrossRef] [PubMed]
  54. Acevedo, N.; Escamilla-Gil, J.M.; Espinoza, H.; Regino, R.; Ramirez, J.; de Arco, L.F.; Dennis, R.; Torres-Duque, C.A.; Caraballo, L. Chronic Obstructive Pulmonary Disease Patients Have Increased Levels of Plasma Inflammatory Mediators Reported Upregulated in Severe COVID-19. Front. Immunol. 2021, 12, 678661. [Google Scholar] [CrossRef]
  55. Beltramo, G.; Cottenet, J.; Mariet, A.S.; Georges, M.; Piroth, L.; Tubert-Bitter, P.; Bonniaud, P.; Quantin, C. Chronic respiratory diseases are predictors of severe outcome in COVID-19 hospitalised patients: A nationwide study. Eur. Respir. J. 2021, 58, 2004474. [Google Scholar] [CrossRef] [PubMed]
  56. Ren, J.; Pang, W.; Luo, Y.; Cheng, D.; Qiu, K.; Rao, Y.; Zheng, Y.; Dong, Y.; Peng, J.; Hu, Y.; et al. Impact of Allergic Rhinitis and Asthma on COVID-19 Infection, Hospitalization, and Mortality. J. Allergy Clin. Immunol. Pract. 2022, 10, 124–133. [Google Scholar] [CrossRef] [PubMed]
  57. Han, X.; Xu, J.; Hou, H.; Yang, H.; Wang, Y. Impact of asthma on COVID-19 mortality in the United States: Evidence based on a meta-analysis. Int. Immunopharmacol. 2022, 102, 108390. [Google Scholar] [CrossRef] [PubMed]
  58. Dhruve, H.; d’Ancona, G.; Holmes, S.; Dhariwal, J.; Nanzer, A.M.; Jackson, D.J. Prescribing Patterns and Treatment Adherence in Patients with Asthma During the COVID-19 Pandemic. J. Allergy Clin. Immunol. Pract. 2022, 10, 100–107. [Google Scholar] [CrossRef]
  59. Bradding, P.; Richardson, M.; Hinks, T.S.C.; Howarth, P.H.; Choy, D.F.; Arron, J.R.; Wenzel, S.E.; Siddiqui, S. ACE2, TMPRSS2, and furin gene expression in the airways of people with asthma-implications for COVID-19. J. Allergy Clin. Immunol. 2020, 146, 208–211. [Google Scholar] [CrossRef]
  60. Bloom, C.I.; Cullinan, P.; Wedzicha, J.A. Asthma Phenotypes and COVID-19 Risk: A Population-based Observational Study. Am. J. Respir. Crit. Care Med. 2022, 205, 36–45. [Google Scholar] [CrossRef]
  61. Liu, S.; Zhi, Y.; Ying, S. COVID-19 and Asthma: Reflection During the Pandemic. Clin. Rev. Allergy Immunol. 2020, 59, 78–88. [Google Scholar] [CrossRef] [PubMed]
  62. Radanliev, P.; De, R.D.; Walton, R.; Van, K.M.; Montalvo, R.M.; Santos, O.; Maddox, L.; Cannady, S. COVID-19 what have we learned? The rise of social machines and connected devices in pandemic management following the concepts of predictive, preventive and personalized medicine. EPMA J. 2020, 11, 311–332. [Google Scholar] [CrossRef] [PubMed]
  63. Zhao, X.; Wang, K.; Zuo, P.; Liu, Y.; Zhang, M.; Xie, S.; Zhang, H.; Chen, X.; Liu, C. Early decrease in blood platelet count is associated with poor prognosis in COVID-19 patients-indications for predictive, preventive, and personalized medical approach. EPMA J. 2020, 11, 139–145. [Google Scholar] [CrossRef] [PubMed]
  64. Ahluwalia, P.; Ahluwalia, M.; Vaibhav, K.; Mondal, A.; Sahajpal, N.; Islam, S.; Fulzele, S.; Kota, V.; Dhandapani, K.; Baban, B.; et al. Infections of the lung: A predictive, preventive and personalized perspective through the lens of evolution, the emergence of SARS-CoV-2 and its pathogenesis. EPMA J. 2020, 11, 581–601. [Google Scholar] [CrossRef]
Figure 1. The total number of patients with respiratory signs and symptoms and the need for hospitalization and non-hospitalization, from March to August 2020, at São Paulo Hospital, São Paulo, Brazil.
Figure 1. The total number of patients with respiratory signs and symptoms and the need for hospitalization and non-hospitalization, from March to August 2020, at São Paulo Hospital, São Paulo, Brazil.
Jcm 11 04574 g001
Figure 2. (A) Variables in order of importance using the reduced predictive model using SHAP or SHAPley Additive exPlanation plot. (B) ROC curve analyses with the area under the curve of 0.9181 (95% CI, 0.9001 to 0.9361).
Figure 2. (A) Variables in order of importance using the reduced predictive model using SHAP or SHAPley Additive exPlanation plot. (B) ROC curve analyses with the area under the curve of 0.9181 (95% CI, 0.9001 to 0.9361).
Jcm 11 04574 g002
Figure 3. Summary of the study design and findings.
Figure 3. Summary of the study design and findings.
Jcm 11 04574 g003
Table 1. Comparison of demography, signs, and symptoms between patients who required hospitalization and those who did not require hospitalization from March to August 2020 at São Paulo Hospital, Brazil.
Table 1. Comparison of demography, signs, and symptoms between patients who required hospitalization and those who did not require hospitalization from March to August 2020 at São Paulo Hospital, Brazil.
VariablesNo Hospitalization (n = 6596)Hospitalization (n = 740)p
MedianInterquartile RangeMedianInterquartile Range
Females (n, %)3570 (54%) 324 (44%) <0.001
Age (year-old)39(28, 51)58(47, 69)<0.001
Duration of symptoms (days)4(2, 8)7(4, 10)<0.001
Systolic blood pressure (mmHg)133(121, 146)129(114, 145)<0.001
Diastolic blood pressure (mmHg)84(75, 94)80(70, 90)<0.001
Heart rate (bpm)90(80, 101)96(85, 110)<0.001
Temperature (°C)36.50 (36.00, 36.80)36.50(36.00, 36.90)0.007
Respiratory rate (bpm)18(16, 20)24(20, 28)<0.001
SpO2 (%)97.00(96.00, 98.00)94.00(90.00, 96.00)<0.001
Influenza vaccine 1897 (39%) 149 (47%) 0.007
Fever 2500 (42%) 289 (57%) <0.001
Fatigue1628 (27%) 184 (37%) <0.001
Sneezing496 (8.3%) 20 (4.0%) <0.001
Dry cough2702 (45%) 262 (52%) 0.004
Productive cough720 (12%) 75 (15%) 0.065
Running nose1304 (22%) 45 (8.9%) <0.001
Sore throat1428 (24%) 38 (7.6%) <0.001
Diarrhea851 (14%) 88 (17%) 0.051
Breathing difficulty1955 (33%) 302 (60%) <0.001
Anorexia614 (10%) 94 (19%) <0.001
Headache2219 (37%) 93 (18%) <0.001
Myalgia1845 (31%) 138 (27%) 0.10
Nausea/vomiting735 (12%) 91 (18%) <0.001
Wheezing136 (2.3%) 11 (2.2%) 0.9
Thoracic pain1059 (18%) 67 (13%) 0.012
Abdominal pain320 (5.4%) 34 (6.7%) 0.2
Anosmia1133 (19%) 80 (16%) 0.084
Dysgeusia1127 (19%) 80 (16%) 0.10
Chills672 (11%) 46 (9.1%) 0.14
SpO2: oxyhemoglobin saturation by pulse oximetry.
Table 2. Comparison of comorbidities between patients who required hospitalization and those who did not require hospitalization from March to August 2020 at São Paulo Hospital, Brazil.
Table 2. Comparison of comorbidities between patients who required hospitalization and those who did not require hospitalization from March to August 2020 at São Paulo Hospital, Brazil.
VariablesNo Hospitalization
(n = 6596)
Hospitalization
(n = 740)
p
Hypertension1227 (21%)232 (46%)<0.001
Cardiac disease 227 (3.8%)68 (14%)<0.001
Diabetes mellitus469 (7.8%)136 (27%)<0.001
Cerebrovascular disease42 (0.7%)17 (3.4%)<0.001
Chronic kidney disease162 (2.7%)66 (13%)<0.001
Immunosuppression230 (3.8%)55 (11%)<0.001
COPD95 (1.6%)13 (2.6%)0.10
Asthma355 (5.9%)19 (3.8%)0.044
Tuberculosis39 (0.7%)6 (1.2%)0.2
Other respiratory diseases87 (1.5%)12 (2.4%)0.10
Neoplasia89 (1.5%)30 (6.0%)<0.001
Solid organ transplant145 (2.4%)63 (12%)<0.001
Obesity305 (5.1%)56 (11%)<0.001
Smoking621 (10%)45 (8.9%)0.3
Pregnancy79 (1.3%)3 (0.6%)0.2
Previous hospitalization37 (0.6%)23 (4.5%)<0.001
COPD: Chronic obstructive pulmonary disease.
Table 3. Datasets (train and test sets) used in predicting hospitalization in patients with respiratory symptoms during the COVID-19 pandemic.
Table 3. Datasets (train and test sets) used in predicting hospitalization in patients with respiratory symptoms during the COVID-19 pandemic.
CharacteristicTrain
n = 5868
Test
n = 1468
Female sex3102 (53%)792 (54%)
Age41 (29, 53)40 (30, 54)
Symptom duration4 (2, 8)4 (2, 8)
Systolic blood pressure133 (121, 147)132 (120, 145)
Diastolic blood pressure84 (74, 94)82 (74, 92)
Heart rate90 (80, 102)89 (80, 100)
Temperature36.50 (36.00, 36.80)36.50 (36.00, 36.70)
Respiratory frequency18 (17, 20)18 (17, 20)
SpO297.00 (96.00, 98.00)97.00 (96.00, 98.00)
Influenza vaccine1632 (40%)414 (39%)
Fever2258 (44%)531 (41%)
Fatigue1456 (28%)356 (27%)
Occasional Cough420 (8.1%)96 (7.4%)
Dry cough2370 (46%)594 (46%)
Phlegm cough 641 (12%)154 (12%)
Running nose1059 (20%)290 (22%)
Sore throat1173 (23%)293 (23%)
Diarrhea733 (14%)206 (16%)
Dyspnea1822 (35%)435 (34%)
Anorexia580 (11%)128 (9.9%)
Headache1851 (36%)461 (36%)
Myalgia1597 (31%)386 (30%)
Nausea and vomiting652 (13%)174 (13%)
Chest wheezing120 (2.3%)27 (2.1%)
Chest pain889 (17%)237 (18%)
Abdominal pain295 (5.7%)59 (4.6%)
Anosmia959 (19%)254 (20%)
Dysgeusia960 (19%)247 (19%)
Chills570 (11%)148 (11%)
Hypertension1193 (23%)266 (20%)
Heart disease238 (4.6%)57 (4.4%)
Diabetes mellitus466 (9.0%)139 (11%)
Cerebrovascular disease47 (0.9%)12 (0.9%)
Chronic kidney disease188 (3.6%)40 (3.1%)
Immunosuppression231 (4.5%)54 (4.2%)
COPD84 (1.6%)24 (1.8%)
Asthma307 (5.9%)67 (5.2%)
Tuberculosis29 (0.6%)16 (1.2%)
Respiratory disease79 (1.5%)20 (1.5%)
Neoplasia97 (1.9%)22 (1.7%)
Transplant171 (3.3%)37 (2.8%)
Obesity284 (5.5%)77 (5.9%)
Smoking535 (10%)131 (10%)
Pregnant61 (1.2%)21 (1.6%)
Prior hospitalization47 (0.9%)13 (1.0%)
Hospitalization592 (10%)148 (10%)
Continuous values are present in medians and percentiles (25 and 75%). SpO2: oxyhemoglobin saturation by pulse oximetry; COPD: chronic obstructive pulmonary disease.
Table 4. Results of model performance in predicting hospitalization in patients with respiratory symptoms during the COVID-19 pandemic. The predictions were retrieved in the test set (n = 1468).
Table 4. Results of model performance in predicting hospitalization in patients with respiratory symptoms during the COVID-19 pandemic. The predictions were retrieved in the test set (n = 1468).
ModelAUC-ROC
[95% CI]
AccuracyBalanced AccuracyF1-ScorePrecision
XGBoost Full 10.927
[0.901–0.945]
0.9050.8000.9460.962
Random Forest 10.930
[0.909–0.945]
0.9050.7790.9470.957
Lasso 10.899
[0.874–0.925]
0.8440.8230.9070.974
Light GBM 10.925
[0.905–0.944]
0.8960.8220.9410.968
XgBoost Reduced 2
(RAPID RISK COVID)
0.917
[0.899–0.935]
0.8860.7930.9350.962
1 Predictors of the full model (n = 45): sex, age, duration of symptoms, systolic blood pressure, diastolic blood pressure, heart rate, temperature, respiratory frequency, SpO2, influenza vaccine, fever, fatigue, occasional cough, dry cough, phlegm cough, running nose, sore throat, diarrhea, dyspnea, anorexia, headache, myalgia, nausea and vomiting, chest wheezing, chest pain, abdominal pain, anosmia, dysgeusia, chills, hypertension, heart disease, diabetes mellitus, cerebrovascular disease, chronic kidney disease, immunosuppression, chronic obstructive pulmonary disease, asthma, tuberculosis, respiratory disease, neoplasia, transplant, obesity, smoking, pregnancy, and prior hospitalization. 2 Predictors of reduced model (n = 11): SpO2, respiratory rate, age, sex, duration of symptoms, presence of hypertension, temperature at admission, presence of DM, heart rate, and systolic and diastolic blood pressure at admission.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

De Freitas, V.M.; Chiloff, D.M.; Bosso, G.G.; Teixeira, J.O.P.; Hernandes, I.C.d.G.; Padilha, M.d.P.; Moura, G.C.; De Andrade, L.G.M.; Mancuso, F.; Finamor, F.E.; et al. A Machine Learning Model for Predicting Hospitalization in Patients with Respiratory Symptoms during the COVID-19 Pandemic. J. Clin. Med. 2022, 11, 4574. https://doi.org/10.3390/jcm11154574

AMA Style

De Freitas VM, Chiloff DM, Bosso GG, Teixeira JOP, Hernandes ICdG, Padilha MdP, Moura GC, De Andrade LGM, Mancuso F, Finamor FE, et al. A Machine Learning Model for Predicting Hospitalization in Patients with Respiratory Symptoms during the COVID-19 Pandemic. Journal of Clinical Medicine. 2022; 11(15):4574. https://doi.org/10.3390/jcm11154574

Chicago/Turabian Style

De Freitas, Victor Muniz, Daniela Mendes Chiloff, Giulia Gabriella Bosso, Janaina Oliveira Pires Teixeira, Isabele Cristina de Godói Hernandes, Maira do Patrocínio Padilha, Giovanna Corrêa Moura, Luis Gustavo Modelli De Andrade, Frederico Mancuso, Francisco Estivallet Finamor, and et al. 2022. "A Machine Learning Model for Predicting Hospitalization in Patients with Respiratory Symptoms during the COVID-19 Pandemic" Journal of Clinical Medicine 11, no. 15: 4574. https://doi.org/10.3390/jcm11154574

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop