Next Article in Journal
Prevalence and Estimation of the Evolution of Myopia in Spanish Children
Previous Article in Journal
Safety and Feasibility of the BYCROSS® Atherectomy Device for the Treatment of Femoropopliteal Arterial Obstructions: Single-Center Short-Term Outcomes
 
 
jcm-logo
Article Menu
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting the Length of Mechanical Ventilation in Acute Respiratory Disease Syndrome Using Machine Learning: The PIONEER Study

by
Jesús Villar
1,2,3,4,*,
Jesús M. González-Martín
1,2,
Cristina Fernández
2,
Juan A. Soler
5,
Alfonso Ambrós
6,
Lidia Pita-García
7,
Lorena Fernández
8,
Carlos Ferrando
1,9,
Blanca Arocas
10,
Myriam González-Vaquero
11,
José M. Añón
1,12,
Elena González-Higueras
13,
Dácil Parrilla
14,
Anxela Vidal
15,
M. Mar Fernández
16,
Pedro Rodríguez-Suárez
1,17,
Rosa L. Fernández
1,2,
Estrella Gómez-Bentolila
2,
Karen E. A. Burns
3,18,19,
Tamas Szakmany
20,
Ewout W. Steyerberg
21 and
the PredictION of Duration of mEchanical vEntilation in ARDS (PIONEER) Network
add Show full author list remove Hide full author list
1
CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, 28029 Madrid, Spain
2
Research Unit, Hospital Universitario Dr. Negrín, 35019 Las Palmas de Gran Canaria, Spain
3
Li Ka Shing Knowledge Institute at St. Michael’s Hospital, Toronto, ON M5B 1W8, Canada
4
Faculty of Health Sciences, Universidad del Atlántico Medio (UNAM), 35017 Tafira Baja, Gran Canaria, Canary Islands, Spain
5
Intensive Care Unit, Hospital Universitario Virgen de Arrixaca, 30120 Murcia, Spain
6
Intensive Care Unit, Hospital General Universitario de Ciudad Real, 13005 Ciudad Real, Spain
7
Intensive Care Unit, Hospital Universitario de A Coruña, 15006 La Coruña, Spain
8
Intensive Care Unit, Hospital Universitario Río Hortega, 47012 Valladolid, Spain
9
Surgical Intensive Care Unit, Department Anesthesia, Hospital Clinic, IDIBAPS, 08036 Barcelona, Spain
10
Department of Anesthesia, Hospital Clínico Universitario de Valencia, 46010 Valenci, Spain
11
Intensive Care Unit, Complejo Asistencial Universitario de León, 24001 León, Spain
12
Intensive Care Unit, Hospital Universitario La Paz, IdiPaz, 28046 Madrid, Spain
13
Intensive Care Unit, Hospital Virgen de La Luz, 16002 Cuenca, Spain
14
Intensive Care Unit, Hospital Universitario NS de Candelaria, 38010 Santa Cruz de Tenerife, Spain
15
Intensive Care Unit, Hospital Universitario Fundación Jiménez Díaz, 28040 Madrid, Spain
16
Intensive Care Unit, Hospital Universitario Mutua Terrassa, 08221 Terrassa, Spain
17
Thoracic Surgery, Hospital Universitario Dr. Negrín, 35019 Las Palmas de Gran Canaria, Spain
18
Critical Care Medicine, Unity Health Toronto-St. Michael’s Hospital, Toronto, ON M5B 1W8, Canada
19
Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON L8S 4L8, Canada
20
Intensive Care, Cardiff University, Cardiff CF14 4XW, UK
21
Department Biomedical Data Sciences, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands
*
Author to whom correspondence should be addressed.
Members of the PIONEER Project are listed in the Supplementary Materials.
J. Clin. Med. 2024, 13(6), 1811; https://doi.org/10.3390/jcm13061811
Submission received: 2 February 2024 / Revised: 19 March 2024 / Accepted: 19 March 2024 / Published: 21 March 2024

Abstract

:
Background: The ability to predict a long duration of mechanical ventilation (MV) by clinicians is very limited. We assessed the value of machine learning (ML) for early prediction of the duration of MV > 14 days in patients with moderate-to-severe acute respiratory distress syndrome (ARDS). Methods: This is a development, testing, and external validation study using data from 1173 patients on MV ≥ 3 days with moderate-to-severe ARDS. We first developed and tested prediction models in 920 ARDS patients using relevant features captured at the time of moderate/severe ARDS diagnosis, at 24 h and 72 h after diagnosis with logistic regression, and Multilayer Perceptron, Support Vector Machine, and Random Forest ML techniques. For external validation, we used an independent cohort of 253 patients on MV ≥ 3 days with moderate/severe ARDS. Results: A total of 441 patients (48%) from the derivation cohort (n = 920) and 100 patients (40%) from the validation cohort (n = 253) were mechanically ventilated for >14 days [median 14 days (IQR 8–25) vs. 13 days (IQR 7–21), respectively]. The best early prediction model was obtained with data collected at 72 h after moderate/severe ARDS diagnosis. Multilayer Perceptron risk modeling identified major prognostic factors for the duration of MV > 14 days, including PaO2/FiO2, PaCO2, pH, and positive end-expiratory pressure. Predictions of the duration of MV > 14 days showed modest discrimination [AUC 0.71 (95%CI 0.65–0.76)]. Conclusions: Prolonged MV duration in moderate/severe ARDS patients remains difficult to predict early even with ML techniques such as Multilayer Perceptron and using data at 72 h of diagnosis. More research is needed to identify markers for predicting the length of MV. This study was registered on 14 August 2023 at ClinicalTrials.gov (NCT NCT05993377).

1. Introduction

Acute respiratory distress syndrome (ARDS) is a type of acute hypoxemic respiratory failure associated with lung injury and impaired gas exchange [1,2]. Mechanical ventilation (MV) is vital for most patients with moderate-to-severe ARDS managed in intensive care units (ICUs), although MV for long periods of time may induce lung injuries and infection [3,4]. The number of MV days is a major driver of high healthcare costs in managing critically ill patients [5]. Hence, a decrease in MV duration is an actionable research goal in ARDS management. The successful prediction of MV duration may impact several clinical decisions, such as the initiation of oral feeding, timing of performing a tracheostomy, transferring to long-term ventilation facilities, alignment with patients’ goals of care, or enrollment into clinical trials [6,7,8].
The prediction of the length of MV is important [6,7,9]. The mortality rate of patients requiring prolonged MV is high [9]. The accuracy of clinicians in predicting MV duration is very limited [6]. Usually, clinicians integrate multiple clinical features that are not well elaborated and make implicit assessments about the possible duration of MV, which are translated into routine medical practice with a wide margin of error [6]. Using logistic regression models, predictions of the duration of MV provide moderate levels of overall accuracy in critically ill patients and are insufficient to assist in clinical decisions [7,9,10]. Patients and clinicians require clinical prediction models to guide healthcare decisions in an evidence-based and personalized manner.
MV duration in ARDS is dependent on both ICU factors and patient-related factors. Advanced machine learning (ML) methods hold promise for enhancing accuracy in predicting MV duration [11,12]. ML is an exploratory process in which algorithms extract knowledge from the data provided. ML flexibility detects relationships between potential clinical features, physiologic parameters, and an outcome [13]. Despite extensive modeling and a large number of clinically relevant features, the discrimination using ML approaches for predicting responders to the prone position in mechanically ventilated patients with COVID-19 [14] was very poor. Scarce studies have evaluated the role of ML in predicting the duration of MV in ARDS patients [11,15,16,17]. Our primary goal was to compare the performance of logistic regression and three powerful ML approaches for the development, testing, and external validation of a model to predict the duration of MV > 14 days after diagnosis of moderate/severe ARDS. We hereto used a relatively large population of patients with moderate/severe ARDS under three different scenarios over time.

2. Methods

Approval of this study was permitted by the Ethics Committee at Hospital Universitario Dr. Negrin, Spain (CEI/CEIm #2021-321-1). Informed consent was waived based on Royal Decrees 1090/2015 and 957/2020 under the Spanish legislation for biomedical research, based on the retrospective nature of this comprehensive analysis, the anonymization/dissociation of data, and no potential harm or benefit to the patients. This study respected the TRIPOD (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis) guidelines for reporting prediction models [18].

2.1. Patient Population and Study Design

This study is an extension of the Spanish SIESTA program [19,20] (Supplementary Materials). We performed a comprehensive analysis, termed the “PredictION of the duration of mEchanical vEntilation in aRds” (PIONEER) study on an unrestricted dataset from 1303 adult (>17 years) patients with moderate/severe ARDS [21] managed with lung-protective MV in a network of Spanish ICUs (Supplemental Materials). None of our ARDS patients had COVID-19. This study was performed in three steps. For the first two steps (model development and testing), we analyzed 1000 patients included in three prospective, multicenter, observational studies (see Supplemental Materials), admitting consecutive patients meeting the definition of moderate/severe ARDS [21]. In the third step, we tested the model performance in an independent cohort of moderate/severe ARDS patients [20] for reliable external validation [22,23].
For the purpose of this study and to avoid selection bias, we only analyzed patients with data from the first 3 ICU days on MV after the diagnosis of moderate/severe ARDS: data captured at the time of diagnosis (T0), data captured at 24 h (T24), and data captured at 72 h (T72). As a result, we excluded patients on MV < 3 days as follows: 80 patients from 1000 patients were included in the development/testing cohort, 50 patients from 303 patients were included in the validation cohort and finally analyzed data from 1173 patients. T0 was defined as the time and day in which the patient first met moderate/severe ARDS criteria, irrespective of the day of ICU admission or the initiation of MV, as mandated by the Berlin definition [21]. All patients had arterial blood gases at the study’s inclusion (Supplementary Materials). At T24, the values of gas-exchange and lung mechanics variables [including PaO2, PaCO2, PaO2/FiO2, and inspiratory plateau pressure (Pplat), among others] were evaluated under standardized ventilator settings [positive end-expiratory pressure (PEEP) of 10 cm H2O and FiO2 of 0.5)] [20]. When patients required PEEP > 10 or FiO2 > 0.5 and could not tolerate a reduction in PEEP or FiO2, the rules for setting PEEP and FiO2 were applied during the standardized evaluation, as validated by our group [24,25]. At other times, PEEP and FiO2 were set at the discretion of attending clinicians. We did not collect data from day 2. For T72, we used representative data at 72 h after the diagnosis of moderate/severe ARDS. We only included patients with moderate/severe ARDS. We excluded patients < 18 years old with acute heart failure, severe chronic pulmonary disease, do-not-resuscitate orders, brain death, or patients on MV for <24 h (see Supplementary Materials).

2.2. Variables and Outcomes

The selection of potentially clinically relevant variables was based on prior studies [25,26,27,28]. We included demographics, comorbidities, ventilator settings, lung mechanics [respiratory rate (RR), tidal volume (VT) as mL/kg predicted body weight (PBW), PEEP, Pplat, driving pressure (calculated as Pplat minus PEEP)] and gas-exchange (FiO2, PaO2, PaCO2, PaO2/FiO2, pH) at T0, T24, and T72. We also recorded the Acute Physiology and Chronic Health Evaluation II (APACHE II) [29] score during the first 24 h of ARDS diagnosis, the Sequential Organ Failure Assessment (SOFA) score [30], and the prevalence of extrapulmonary organ failures (OFs) included in the SOFA scale (Supplementary Materials). In each patient, we recorded data from 165 variables during their ICU stay (Supplementary Materials). We collected the date and status (alive/dead) of patients in the ICU and at hospital discharge.
We examined the performance of each method at T0, T24, and T72. For the purpose of our study, prolonged MV was defined as being ventilated for more than 14 days after the diagnosis of moderate/severe ARDS [6], independently of the number of patients who died between day 3 and day 14.

2.3. Predefined Rules and Statistical Analysis

We focused on variables collected during the first 3 MV days after the diagnosis of moderate/severe ARDS to estimate the probability of MV duration > 14 days, independent of the cause of death or the underlying disease (Figure S1 and Table S1). For variable selection, our aim was to identify clinically relevant variables while avoiding redundant variables. We first analyzed the following features as potential predictors of prolonged MV: age, sex, comorbidities, SOFA score, number of extrapulmonary organ failures, PaO2, PaO2/FiO2, PaCO2, pH, FiO2, VT, RR, PEEP, Pplat, and driving pressure. To avoid multicollinearity [31], we performed matrices at T0, T24, and T72 to assess the correlation between variables in the dataset [32,33]. We also performed a principal component analysis to summarize the information content that could be easily visualized and analyzed [33,34] (Figures S3 and S4, Supplementary Materials).
We specified an a priori statistical analysis plan (Supplementary Materials). We identified potential variables that could be considered in the prediction model based on predefined rules and the contribution to the area under the receiver operating characteristic curve (AUC) in relation to MV duration. We assessed differences in the values of clinically relevant features at T0, T24, and T72 and across the development/testing cohort and the external validation cohort. We reported the odds ratio and 95% confidence intervals (CIs) and standard classification metrics (sensitivity, specificity, false positives, and false negatives) for each model.

Feature Selection Method

Since the inclusion of all available variables in ML can lead to complex models that are difficult to interpret, we screened variables employing a genetic algorithm (GA) variable selection method [34] to achieve parsimony and identify a subset of relevant variables while excluding noise/redundant variables. GA variable selection is a technique that helps to identify a subset of the measured variables that are, for a given problem, the most useful for a precise and accurate regression model. Although many variables may be of use in prediction, several considerations may preclude measuring all the variables originally considered for a prediction model. It is useful to identify a subset of variables that allow sufficient prediction accuracy and precision while minimizing the number of variables to be measured. GAs provide a straightforward method based on a “survival of the fittest” approach to modeling data. GAs create random populations of artificial individuals that are evaluated by a mathematical fitness function and have been successfully applied to solve optimization problems, both for continuous and discrete functions (more details in Supplementary Materials). Our findings indicate that, for the purpose of our study, the GA approach is appropriate for finding an efficient subset of variables for combinations that are optimal for solving high-dimensional classification problems. The duration of MV can be treated as a classification problem. The selection of an optimized set of variables in our three early scenarios (T0, T24, T72) is key to the PIONEER study for predicting the prolonged duration of MV, especially when the search is large, complex or poorly understood, as in the setting of moderate/severe ARDS. We applied GA to optimize the subset of selected variables by reducing the Akaike information criterion (AIC) and the Bayesian information criterion (BIC) [35]. Lower values of both criteria are preferable.
We performed statistical analysis using R (version 4.3.1, R Foundation for Statistical Computing, Vienna, Austria). A two-sided p < 0.005 was considered a real effect size [36].
We built the PIONEER prediction model by considering the lowest number of variables obtained by GA and providing the best performance of the three scenarios in the development database. We optimized the quality of models based on a 5-fold cross-validation approach for randomly splitting the development/training dataset and repeated this process 100 times (Supplementary Materials). We assessed the final lowest number of variables model using logistic regression and three supervised ML algorithms, Random Forest, Support Vector Machine, and Multilayer Perceptron [37,38], to generate prediction models for MV duration > 14 days after moderate/severe ARDS diagnosis. We also assessed the validity of the prediction models according to calibration and discrimination in an external validation cohort [39,40] (Supplementary Materials).
Figure 1 and Figure S2 summarize the study design.

3. Results

The baseline demographics, etiology, degree of severity, and outcome data of our patient population are reported in Table S2. The most common etiologies were pneumonia, sepsis, aspiration, and trauma in both cohorts (development and external validation). Most patients had moderate ARDS based on stratification by the Berlin criteria at T0. A total of 1173 patients [920 patients from the derivation cohort and 253 patients from the validation cohort] (Table 1) received MV for ≥3 days after moderate/severe ARDS diagnosis (Supplementary File), and their ICU mortality rate was similar (307/920, 33% vs. 77/253, 30%, respectively, p = 0.420) (Table S3). A total of 49% (441/920) patients from the derivation cohort and 40% (100/253) patients from the validation cohort were mechanically ventilated for >14 days [median 14 days (IQR 8–25) vs. 13 days (IQR 7–21), respectively]. A similar ICU mortality rate was found in patients ventilated for 3–14 days and for >14 days [155/479 (32%) vs. 152/441 (35%)] in the derivation cohort (Table 2 and Table S3).
From twenty clinically relevant variables collected at T0, T24, and T72 (Table S1), four variables were excluded for multicollinearity as follows: the PaO2, FiO2, SOFA score, and driving pressure. (Figure S3 and Tables S4–S6, Supplementary Results). Few characteristics remained associated with the duration of MV > 14 days in multivariable logistic regression analysis. The performance of the model with all 16 clinical variables had a cross-validated AUC of 0.61 at T0, 0.61 at T24, and 0.66 at T72. Principal component analysis showed that patients differed more at T72 than at T0 or T24 (Figure S4). When applying GA for variable selection and optimizing AIC and BIC (for both criteria, lower values are preferred), the resulting models reduced the number of predictors from 16 to 9, 6, 11, and 4 variables, respectively (Tables S7–S10).
Among the variables with higher importance for predicting the duration of MV > 14 days at T72 were PaO2/FiO2, PaCO2, pH, and PEEP (Figure S5). The performance of ML methods and logistic regression for the scenarios in the development database is shown in Table 3. Multilayer Perceptron (MLP) provided the highest AUC values, and data collected at T72 provided better performance than data at T0 or T24.
Starting from 11 variables in the PIONEER model, the MLP technique was most promising for predicting MV duration > 14 days using data at T72 with the MLP technique (AUC 0.71, 95%CI 0.65–0.76). Models developed at one time period were not transferable to other time periods. The external validation of the best prediction models at T72 obtained from the derivation cohort for the duration of MV prediction is reported in Table S11. The ML models had poor calibration, implying the poor reliability of absolute risk predictions (Figure S6).

4. Discussion

In our study, the MLP technique predicted prolonged MV duration in patients with moderate-to-severe ARDS with modest accuracy. As expected, data captured at the baseline or at T24 were less predictive than data obtained at T72. Our findings highlight 4 out of 11 clinical features collected at T72 (Table S9) as follows: PaO2/FiO2, PaCO2, pH, and PEEP. Although these variables have been recognized as risk factors of prolonged MV, our study illustrates that absolute risk prediction based on these variables remains limited.
The PIONEER prediction model could contribute to addressing some of these risk features for reducing the duration of MV in moderate/severe ARDS. We recognize that MV duration in ARDS is also dependent on ICU factors. Our study confirms that it is difficult to predict the time of ending MV in moderate/severe ARDS, even with modern ML techniques. Evolving clinical practice has led to the clinical use of non-invasive ventilation in mild and moderate ARDS [1,41]. Furthermore, organizational culture, staffing pressure, and various quality improvement initiatives implementing evidence-based management could be more important than patient-related factors. Our multicenter study cannot answer these issues since we would need data on all ICU-related factors, which are contextual, variable, and probably do not exist in an interpretable format. Given the available data, patients who would benefit the most from “actionable” timely therapeutic interventions or the early warning of possible adverse events [42] are those with the highest risk for prolonged MV duration [43]. The current ARDS framework [2] limits our ability to fully determine which patients need MV for >14 days. We do not know whether updating the ARDS definition addresses any of the existing limitations related to other aspects impacting ARDS recognition and management. A more refined prediction model that captures high-risk patients and ICU-related features in a timely manner could be beneficial. This would have the potential to guide clinicians in therapeutic choices in specific ARDS groups.
The early assessment of MV duration > 14 days may be important for evidence-based interventions that accelerate MV discontinuation [44]. Our findings suggest that MV duration only can be modestly predicted as early as T72. The selection of patients with a high risk of prolonged MV might affect significant clinical decisions, including referral to other centers, transfers to long-term ventilator units, the timing of performing a tracheostomy [8] or alignment with goals of care [6]. In addition, an early prediction model for the prolonged duration of MV could optimize ICU resource use [45]. Our model could determine with modest accuracy if a patient remains intubated after 14 days of ARDS diagnosis using commonly accessible variables during the first three MV days after diagnosis. In general, the requisite for a successful ML application is the presence of “useful patterns” in the data. In the current environment, it is far from optimized (if not unrealistic) to enable ML to provide a precise early prediction of prolonged MV duration in ARDS for clinicians. A comparison with previous ML studies on MV duration is difficult because investigators used different ML techniques, different timeframes, different ML metrics, or different populations [5,9,23,46]. Since the definition of prolonged MV is inconsistent, the performance assessment of related prediction models is not applicable to all situations [47]. We believe that using input criteria that maximize the measurement of variables commonly available to clinicians and minimize subjective clinician judgement is a preferable strategy.
Clinicians are interested in actionable and modifiable variables for improving outcomes. It seems unrealistic that ventilated moderate/severe ARDS patients can be weaned quickly. We think that the following targets are reasonable within the first three days and beyond ARDS management to decrease MV duration as follows: improving oxygenation for PaO2/FiO2 > 150 mmHg [48], the use of high PEEP [49] while keeping Pplat < 29 cmH2O [50], and aiming to reduce extrapulmonary organ dysfunction [51]. Of note, the best strategies to reduce MV duration in patients with ARDS have not been identified yet and require further research.
Of note, there is divergence in the use of invasive MV across countries [52], and patient characteristics vary markedly. This divergence highlights the need to better understand patient-, clinician-, and system-level choices associated with the use of MV. The impact of higher PEEP levels on ARDS has long been debated [53]. Available evidence suggests that high PEEP could be beneficial for moderate/severe ARDS. In the ALIVE epidemiological study [54], ICU mortality increased as the PaO2/FiO2 ratio was <150 mmHg and a pH ≤ 7.30. Permissive hypercapnia is sporadically required to allow lung-protective MV. After the induction of hypercapnia, pH decreases markedly and gradually moves toward normalization at 72 h. The clinical effects of hypercapnia are conflicting since metabolic acid-base adaptation triggered by hypercapnia is a complex process. pH compensation makes the VT reduction more acceptable in ARDS; however, concurrent infections could influence the metabolic adaptation to hypercapnia [55]. In a recent systematic review [56], permissive hypercapnia was associated with increased survival, although hypercapnia was imposed under lung-protective MV, and it was associated with a worse outcome. Changes in pH, PaCO2 or the PaO2/FiO2 ratio are strong predictors for disease progression in ARDS [57]. PaCO2 value is required for the calculation of alveolar dead space. We did not include dead space in the PIONEER model (end-tidal CO2 was not regularly measured in our patients throughout the first three days of MV), but we acknowledge that temporal variations in dead space during MV are associated with ICU outcomes [58] in addition to the early time-course of gas exchange in ARDS [59].
The strengths of our study include the fact that we analyzed a large number of patients from a multicenter network, reflecting the current clinical practice and contemporary case mix in moderate/severe ARDS. Starting from prior clinical knowledge represents an important strength leading to clinically relevant interpretations that are less likely to generate unreasonable predictions [60]. The use of general practices in ARDS across ICUs provides comparative data to support conclusions about the PIONEER model, allowing the potential utility to improve timely clinical interventions for ARDS. On the other hand, we also acknowledge some limitations of our study. First, we only investigated patients from the Spanish healthcare system managed with lung-protective MV; hence, the data might not be valid in other settings or in patients ventilated with large VT. Second, we ignored whether the model worked for each etiology, although they were manifested by the same syndrome. We felt that minimal exclusion criteria and a relatively large sample size attenuated these concerns. Third, our data were collected before the COVID-19 pandemic, and it is plausible that ARDS ventilator management might have changed since then [1]. Fourth, we trained our algorithms using only three ML techniques; others might lead to better prediction models. Fifth, we did not collect precise information on factors associated with weaning difficulties, such as sedatives, neuromuscular blocking agents, fluid balance, the level of consciousness, secondary infections, and muscle function. Sixth, our definition of prolonged MV excluded those who died before 14 days. Finally, the ML methods provided poorly calibrated predictions, implying that absolute risk predictions are not reliable for individual patients.

5. Conclusions

We confirmed that predicting MV duration in ARDS is complex, even with data captured at T72 compared to the baseline or at T24. ML techniques are promising but insufficient to provide reliable patient-level predictions. Clinical determinants of MV duration in ARDS are multifactorial and should be studied further to support the timely management and treatment of ARDS, focusing on optimizing gas exchange, lung strain, and variations in pH secondary to the natural evolution of the disease process. More research is needed to identify clinical and organizational variables that might help to recognize patients who are likely to have prolonged MV.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jcm13061811/s1.

Author Contributions

J.V., J.M.G.-M., E.W.S., K.E.A.B. and T.S., contributed to the initial study concept and design. J.V., J.M.G.-M., J.M.A., C.F. (Carlos Ferrando) and P.R.-S. obtained funding for the study. All authors contributed to the final study design, or participated in its coordination, or in drafting the first manuscript. J.A.S., A.A., L.P.-G., L.F., C.F. (Carlos Ferrando), B.A., J.M.A., M.G.-V., E.G.-H., D.P., A.V. and M.M.F. enrolled patients into the study and participated in the data collection and data analysis. J.V., C.F. (Cristina Fernández), R.L.F., E.G.-B., J.M.G.-M., K.E.A.B., E.W.S. and T.S. are responsible for data analysis and/or interpretation of data. J.V., C.F. (Cristina Fernández), R.L.F., E.G.-B. and J.M.G.-M. had full access to all data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. J.V., J.M.G.-M., C.F. (Cristina Fernández), C.F. (Carlos Ferrando), J.M.A., E.W.S., K.E.A.B. and T.S. participated in the final version of this manuscript. J.V., J.M.G.-M. and T.S. contributed to the final revised article. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the Instituto de Salud Carlos III, Madrid, Spain (PI19/0141), The European Regional Development’s Funds (FEDER), Fundación Canaria Instituto de Investigación Sanitaria de Canarias, Spain (PIFIISC20-51, PIFIISC21-36), and Asociación Científica Pulmón y Ventilación Mecánica, Spain. J. Villar, J.M. Añón, C. Ferrando, R.L. Fernández, J.M. González-Martín received grant support from the Instituto de Salud Carlos III, Madrid, Spain (CB06/06/1088).

Institutional Review Board Statement

Approval of this study was given by the Ethics Committees of Hospital Universitario Dr. Negrín (Las Palmas de Gran Canaria, Spain). Requirement for informed consent was waived (Reference CEI/CEIm 2021-321-1) under the Royal Decrees 1090/2015 and 957/2020 based on the Spanish legislation for biomedical research due to the retrospective nature of the secondary analysis, the anonymization/dissociation of data, and no harm and no benefit to managing the patients. This was a comprehensive analysis using unrestricted data from our previously published studies in patients with moderate-to-severe ARDS that were approved by the referral Ethics Committees of Hospital Universitario Dr. Negrín (Las Palmas de Gran Canaria, Spain), Hospital Virgen de La Luz (Cuenca, Spain), Hospital Clínico Universitario (Valladolid, Spain), Hospital Universitario La Paz (Madrid, Spain), and Hospital Clínico de Valencia (Valencia, Spain), and adopted by all participating centers. This study was conducted in accordance with the fundamental principles established in the Declaration of Helsinki, the Convention of the European Council related to human rights and biomedicine, the Ethical Guidelines for Health-Related Research Involving Humans by the Council for International Organization of medical Sciences of the World Health Organization (WHO), and within the requirements established by the Spanish legislation for biomedical research, the protection of personal data, and bioethics.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data needed to evaluate the conclusions in this article are presented and tabulated in the main text or the Supplemental Materials. Data are available from the corresponding author on reasonable request.

Acknowledgments

The authors would like to recognize Robert M. Kacmarek (now deceased) for his contribution to the initial study concept.

Conflicts of Interest

All authors declare no competing interests. None of the clinical investigators received any honorarium for participating in the study.

References

  1. Grasselli, G.; Calfee, C.S.; Camporota, L.; Poole, D.; Amato, M.B.; Antonelli, M.; Arabi, Y.M.; Baroncelli, F.; Beitler, J.R.; Bellani, G.; et al. ESICM guidelines on acute respiratory distress syndrome: Definition, phenotyping and respiratory support strategies. Intensive Care Med. 2023, 49, 727–759. [Google Scholar] [CrossRef]
  2. Maslove, D.M.; Tang, B.; Shankar-Hari, M.; Lawler, P.R.; Angus, D.C.; Baillie, J.K.; Baron, R.M.; Bauer, M.; Buchman, T.G.; Calfee, C.S.; et al. Redefining critical illness. Nature Med. 2022, 28, 1141–1148. [Google Scholar] [CrossRef] [PubMed]
  3. Slutsky, A.S.; Ranieri, V.M. Ventilator-induced lung injury. N. Engl. J. Med. 2013, 389, 2126–2136. [Google Scholar] [CrossRef] [PubMed]
  4. Hybmayr, R.D.; Burchardi, H.; Elliot, M.; Fessler, H.; Georgopoulos, D.; Jubran, A.; Limper, A.; Pesenti, A.; Rubenfeld, G.; Stewart, T.; et al. Statement of the 4th International Conference in Critical Care on ICU-acquired pneumonia-Chicago, Illinois, May 2002. Intensive Care Med. 2002, 28, 1521–1536. [Google Scholar] [CrossRef]
  5. Dasta, J.F.; McLaughlin, T.P.; Mody, S.H.; Piech, C.T. Daily cost of an intensive care unit day: The contribution of mechanical ventilation. Crit. Care Med. 2005, 33, 1266–1271. [Google Scholar] [CrossRef]
  6. Figueroa-Casas, J.B.; Connery, S.M.; Montoya, R.; Dwivedi, A.K.; Lee, S. Accuracy of Early Prediction of Duration of Mechanical Ventilation by Intensivists. Ann. Am. Thorac. Soc. 2014, 11, 182–185. [Google Scholar] [CrossRef]
  7. Figueroa-Casas, J.B.; Dwivedi, A.K.; Connery, S.M.; Quansah, R.; Ellerbrook, L.; Galvis, J. Predictive models of prolonged mechanical ventilation yield moderate accuracy. J. Crit. Care 2015, 30, 502–505. [Google Scholar] [CrossRef]
  8. Raimondi, N.; Vial, M.R.; Calleja, J.; Quintero, A.; Cortés, A.; Celis, E.; Pacheco, C.; Ugarte, S.; Añón, J.M.; Hernández, G.; et al. Evidence-based guidelines for the use of tracheostomy in critically ill patients. J. Crit. Care 2016, 38, 304–318. [Google Scholar] [CrossRef] [PubMed]
  9. Troché, G.; Moine, P. Is the duration of mechanical ventilation predictable? Chest 1997, 112, 745–751. [Google Scholar] [CrossRef]
  10. Añón, J.; Gómez-Tello, V.; González-Higueras, E.; Oñoro, J.; Córcoles, V.; Quintana, M.; López-Martínez, J.; Marina, L.; Choperena, G.; García-Fernández, A.; et al. Prolonged mechanical ventilation probability model. Med. Intensive 2012, 36, 488–495. [Google Scholar] [CrossRef]
  11. Sayed, M.; Riaño, D.; Villar, J. Predicting duration of mechanical ventilation using supervised machine learning. J. Clin. Med. 2021, 10, 3824. [Google Scholar] [CrossRef] [PubMed]
  12. Vali, M.; Paydar, S.; Self, M.; Sabetian, G.; Abujaber, A.; Ghaem, H. Prediction prolonged mechanical ventilation in trauma patients of the intensive care units according to initial medical factors: A machine learning approach. Sci. Rep. 2023, 13, 5925. [Google Scholar] [CrossRef] [PubMed]
  13. Gutierrez, G. Artificial Intelligence in the Intensive Care Unit. Crit. Care 2020, 24, 101. [Google Scholar] [CrossRef] [PubMed]
  14. Dam, T.A.; Roggeveen, L.F.; van Diggelen, F.; Fleuren, L.M.; Jagesar, A.R.; Otten, M.; de Vries, H.J.; Gommers, D.; Cremer, O.L.; Bosman, R.J.; et al. Predicting responders to prone positioning in mechanically ventilated patients with COVID-19 using machine learning. Ann. Intensive Care 2022, 12, 99. [Google Scholar] [CrossRef] [PubMed]
  15. Parreco, J.; Hidalgo, A.; Parks, J.J.; Kozol, R.; Rattan, R. Using artificial intelligence to predict prolonged mechanical ventilation and tracheostomy placement. J. Surg. Res. 2018, 228, 179–187. [Google Scholar] [CrossRef] [PubMed]
  16. Ding, X.F.; Li, J.B.; Liang, H.Y.; Wang, Z.Y.; Jiao, T.T.; Liu, Z.; Yi, L.; Bian, W.S.; Wang, S.P.; Zhu, X.; et al. Predictive model for acute respiratory distress syndrome events in ICU patients in China using machine learning algorithms: A secondary analysis of a cohort study. J. Trans. Med. 2019, 17, 326. [Google Scholar] [CrossRef] [PubMed]
  17. Wang, Z.; Zhang, L.; Huang, T.; Yang, R.; Cheng, H.; Wang, H.; Yin, H.; Lyu, J. Developing an explainable machine learning model to predict the mechanical ventilation duration of patients with ARDS in intensive care units. Hear. Lung 2023, 58, 74–81. [Google Scholar] [CrossRef]
  18. Collins, G.S.; Reitsma, J.B.; Altman, D.G.; Moons, K.G. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. J. Clin. Epidemiol. 2015, 68, 112–121. [Google Scholar] [CrossRef]
  19. Villar, J.; Pérez-Méndez, L.; Blanco, J.; Añón, J.M.; Blanch, L.; Belda, J.; Santos-Bouza, A.; Fernández, R.L.; Kacmarek, R.M.; Spanish Initiative for Epidemiology, Stratification and Therapies for ARDS (SIESTA) network. A universal definition of ARDS: The PaO2/FiO2 under a standard ventilator setting—A prospective, multicenter, validation study. Intensive Care Med. 2013, 39, 583–592. [Google Scholar] [CrossRef]
  20. Villar, J.; Mora-Ordoñez, J.M.; Soler, J.A.; Mosteiro, F.; Vidal, A.; Ambrós, A.; Fernández, L.; Murcia, I.; Civantos, B.; Romera, M.A.; et al. The PANDORA Study: Prevalence and Outcome of Acute Hypoxemic Respiratory Failure in the Pre-COVID-19 Era. Crit. Care Explor. 2022, 4, e0684. [Google Scholar] [CrossRef]
  21. ARDS Definition Task Force; Ranieri, V.M.; Rubenfeld, G.D.; Thompson, B.T.; Ferguson, N.; Caldwell, E.; Fan, E.; Camporota, L.; Slutsky, A.S. Acute respiratory distress syndrome. The Berlin definition. JAMA 2012, 307, 2526–2533. [Google Scholar]
  22. Vergouwe, Y.; Steyerberg, E.W.; Eijkemans, M.J.; Habbema, J.D.F. Substantial effective sample sizes were required for external validation studies of predictive logistic regression models. J. Clin. Epidemiol. 2005, 58, 475–483. [Google Scholar] [CrossRef]
  23. Leisman, D.E.B.; Harhay, M.O.; Lederer, D.J.; Abramson, M.M.; Adjei, A.A.; Bakker, J.M.; Ballas, Z.K.; Barreiro, E.; Bell, S.C.M.; Bellomo, R.; et al. Development and Reporting of Prediction Models: Guidance for Authors from Editors of Respiratory, Sleep, and Critical Care Journals. Crit. Care Med. 2020, 48, 623–633. [Google Scholar] [CrossRef]
  24. Villar, J.; Pérez-Méndez, L.; López, J.; Belda, J.; Blanco, J.; Saralegui, I.; Suárez-Sipmann, F.; López, J.; Lubillo, S.; Kacmarek, R.M.; et al. An Early PEEP/FiO2 Trial Identifies Different Degrees of Lung Injury in Patients with Acute Respiratory Distress Syndrome. Am. J. Respir. Crit. Care Med. 2007, 176, 795–804. [Google Scholar] [CrossRef] [PubMed]
  25. Villar, J.; Ambrós, A.; Mosteiro, F.; Martínez, D.; Fernández, L.; Ferrando, C.; Carriedo, D.; Soler, J.A.; Parrilla, D.; Hernández, M.; et al. A Prognostic Enrichment Strategy for Selection of Patients with Acute Respiratory Distress Syndrome in Clinical Trials. Crit. Care Med. 2019, 47, 377–385. [Google Scholar] [CrossRef] [PubMed]
  26. Dai, Q.; Wang, S.; Liu, R.; Wang, H.; Zheng, J.; Yu, K. Risk factors for outcomes of acute respiratory distress syndrome patients: A retrospective study. J. Thorac. Dis. 2019, 11, 673–685. [Google Scholar] [CrossRef] [PubMed]
  27. Le, S.; Pellegrini, E.; Green-Saxena, A.; Summers, C.; Hoffman, J.; Calvert, J.; Das, R. Supervised machine learning for the early prediction of acute respiratory distress syndrome (ARDS). J. Crit. Care 2020, 60, 96–102. [Google Scholar] [CrossRef] [PubMed]
  28. Villar, J.; González-Martín, J.M.; Hernández-González, J.; Armengol, M.A.; Fernández, C.; Martín-Rodríguez, C.; Mosteiro, F.; Martínez, D.; Sánchez-Ballesteros, J.; Ferrando, C.; et al. Predicting ICU Mortality in Acute Respiratory Distress Syndrome Patients Using Machine Learning: The Predicting Outcome and STratifiCation of severity in ARDS (POSTCARDS) Study. Crit. Care Med. 2023, 51, 1638–1649. [Google Scholar] [CrossRef] [PubMed]
  29. Knaus, W.A.; Draper, E.A.; Wagner, D.P.; Zimmerman, J.E. APACHE II: A severity of disease classification system. Crit. Care Med. 1985, 13, 818–829. [Google Scholar] [CrossRef] [PubMed]
  30. Vincent, J.L.; de Mendonça, A.; Cantraine, F.; Moreno, R.; Takala, J.; Suter, P.M.; Sprung, C.L.; Colardyn, F.; Blecher, S. Use of the SOFA score to assess the incidence of organ dysfunction/failure in intensive care units: Results of a multicenter, prospective study. Working group on “sepsis-related problems” of the European Society of Intensive Care Medicine. Crit. Care Med. 1998, 26, 1793–1800. [Google Scholar] [CrossRef] [PubMed]
  31. Kim, J.H. Multicollinearity and misleading statistical results. Korean J. Anesthesiol. 2019, 72, 558–569. [Google Scholar] [CrossRef] [PubMed]
  32. Wang, X.; Meng, L.; Zhang, J.; Zhao, Z.; Zou, L.; Jia, Z.; Han, X.; Zhao, L.; Song, M.; Zong, J.; et al. Identification of ferroptosis-related molecular clusters and genes for diabetic osteoporosis based on the machine learning. Front. Endocrinol. 2023, 14, 1189513. [Google Scholar] [CrossRef]
  33. Jolliffe, I.T.; Cadima, J. Principal component analysis: A review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2016, 374, 20150202. [Google Scholar] [CrossRef]
  34. Scrucca, L. GA: A Package for Genetic Algorithms in R. J. Stat. Softw. 2013, 53, 1–37. [Google Scholar] [CrossRef]
  35. Vrieze, S.I. Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). Psychol. Methods 2012, 17, 228–243. [Google Scholar] [CrossRef] [PubMed]
  36. Ioannidis, J.P.A. The Proposal to Lower P Value Thresholds to. 005. JAMA 2018, 319, 1429–1430. [Google Scholar] [CrossRef]
  37. Kim, J.H.; Kwon, Y.S.; Baek, M.S. Machine Learning Models to Predict 30-Day Mortality in Mechanically Ventilated Patients. J. Clin. Med. 2021, 10, 2172. [Google Scholar] [CrossRef]
  38. Rashid, M.; Ramakrishnan, M.; Chandran, V.P.; Nandish, S.; Nair, S.; Shanbhag, V.; Thunga, G. Artificial intelligence in acute respiratory distress syndrome: A systematic review. Artif. Intell. Med. 2022, 131, 102361. [Google Scholar] [CrossRef] [PubMed]
  39. Steyerberg, E.W.; Vergouwe, Y. Towards better clinical prediction models: Seven steps for development and an ABCD for validation. Eur. Hear. J. 2014, 35, 1925–1931. [Google Scholar] [CrossRef]
  40. Van Calster, B.; Nieboer, D.; Vergouwe, Y.; De Cock, B.; Pencina, M.J.; Steyerberg, E.W. A calibration hierarchy for risk models was defined: From utopia to empirical data. J. Clin. Epidemiol. 2016, 74, 167–176. [Google Scholar] [CrossRef]
  41. Schwager, E.; Liu, X.; Nabian, M.; Feng, T.; French, R.M.; Amelung, P.; Atallah, L.; Badawi, O. Machine learning prediction of the total duration of invasive and non-invasive ventilation During ICU Stay. PLoS Digit. Health 2023, 2, e0000289. [Google Scholar] [CrossRef]
  42. Smit, J.M.; Krijthe, J.H.; van Bommel, J.; van Genderen, M.; Labrecque, J.; Komorowski, M.; Gommers, D.; Reinders, M.J.T.; the Causal Inference for ICU Collaborators. The future of artificial intelligence in intensive care: Moving from predictive to actionable AI. Intensive Care Med. 2023, 49, 1114–1116. [Google Scholar] [CrossRef] [PubMed]
  43. Kent, D.M.; van Klaveren, D.; Paulus, J.K.; D’Agostino, R.; Goodman, S.; Hayward, R.; Ioannidis, J.P.; Patrick-Lake, B.; Morton, S.; Pencina, M.; et al. The Predictive Approaches to Treatment effect Heterogeneity (PATH) Statement: Explanation and Elaboration. Ann. Intern. Med. 2020, 172, W1–W25. [Google Scholar] [CrossRef]
  44. Villar, J.; González-Martin, J.M.; Añón, J.M.; Ferrando, C.; Soler, J.A.; Mosteiro, F.; Mora-Ordoñez, J.M.; Ambrós, A.; Fernández, L.; Montiel, R.; et al. Clinical relevance of timing of assessment of ICU mortality in patients with moderate-to-severe Acute Respiratory Distress Syndrome. Sci. Rep. 2023, 13, 1543. [Google Scholar] [CrossRef]
  45. Marti, J.; Hall, P.; Hamilton, P.; Lamb, S.; McCabe, C.; Lall, R.; Darbyshire, J.; Young, D.; Hulme, C. One-year resource utilisation, costs and quality of life in patients with acute respiratory distress syndrome (ARDS): Secondary analysis of a randomised controlled trial. J. Intensive Care 2016, 4, 56. [Google Scholar] [CrossRef] [PubMed]
  46. Pirracchio, R.; Petersen, M.L.; Carone, M.; Rigon, M.R.; Chevret, S.; van der Laan, M.J. Mortality prediction in intensive care units with the Super ICU Learner Algorithm (SICULA): A population-based study. Lancet Respir. Med. 2015, 3, 42–52. [Google Scholar] [CrossRef] [PubMed]
  47. Rose, L.; McGinlay, M.; Amin, R.; Burns, K.E.; Connolly, B.; Hart, N.; Jouvet, P.; Katz, S.; Leasa, D.; Mawdsley, C.; et al. Variation in definition of prolonged mechanical ventilation. Respir. Care 2017, 62, 1324–1332. [Google Scholar] [CrossRef]
  48. Villar, J.; Fernández, C.; González-Martín, J.M.; Ferrando, C.; Añón, J.M.; del Saz-Ortíz, A.M.; Díaz-Lamas, A.; Bueno-González, A.; Fernández, L.; Domínguez-Berrot, A.M.; et al. Respiratory Subsets in Patients with Moderate to Severe Acute Respiratory Distress Syndrome for Early Prediction of Death. J. Clin. Med. 2022, 11, 5724. [Google Scholar] [CrossRef] [PubMed]
  49. Briel, M.; Meade, M.; Mercat, A.; Brower, R.G.; Talmor, D.; Walter, S.D.; Slutsky, A.S.; Pullenayegum, E.; Zhou, Q.; Cook, D.; et al. Higher versus lower positive end-expiratory pressure in patients with acute lung injury and acute respiratory distress syndrome: Systematic review and meta-analysis. JAMA 2010, 303, 865–873. [Google Scholar] [CrossRef]
  50. Shin, K.K.; Rosen, M.J. Is there a safe plateau pressure threshold for patients with acute lung injury and acute respiratory distress syndrome? Am. J. Respir. Crit. Care Med. 2006, 173, 686–687. [Google Scholar]
  51. Kallet, R.H.; Lipnick, M.S.; Zhuo, H.; Pangilinan, L.P.; Gomez, A. Characteristics of non-pulmonary organ dysfunction at onset of ARDS based on the Berlin definition. Respir. Care 2019, 64, 493–501. [Google Scholar] [CrossRef] [PubMed]
  52. Jivraj, N.K.; Hill, A.D.; Shieh, M.-S.; Hua, M.; Gershengorn, H.B.; Ferrando-Vivas, P.; Harrison, D.; Rowan, K.; Lindenauer, P.K.; Wunsch, H. Use of Mechanical Ventilation Across 3 Countries. JAMA Intern. Med. 2023, 183, 824–831. [Google Scholar] [CrossRef]
  53. Neto, A.S.; Tomlinson, G.; Sahetya, S.K.; Ball, L.; Nichol, A.D.; Hodgson, C.; Cavalcanti, A.B.; Briel, M.; Gama de Abreu, M.; Pelosi, P.; et al. Higher PEEP for acute respiratory distress síndrome: A Bayesain meta-analysis of randomized clinical trials. Crit. Care Resusc. 2021, 23, 171–182. [Google Scholar]
  54. Brun-Buisson, C.; Minelli, C.; Bertolini, G.; Brazzi, L.; Pimentel, J.; Lewandowski, K.; Bion, J.; Romand, J.-A.; Villar, J.; Thorsteinsson, A.; et al. Epidemiology and outcome of acute lung injury in European intensive care units. Results from the ALIVE study. Intensive Care Med. 2004, 30, 51–61. [Google Scholar] [CrossRef]
  55. Romano, T.G.; Correia, M.D.T.; Mendes, P.V.; Zampieri, F.G.; Maciel, A.T.; Park, M. Metabolic acid-base adaptation triggered by acute persistent hypercapnia in mechanically ventilated patients with acute respiratory distress syndrome. Rev. Bras. Ter. Intensive 2016, 28, 19–26. [Google Scholar] [CrossRef] [PubMed]
  56. Gendreau, S.; Geri, G.; Pham, T.; Vieillard-Baron, A.; Dessap, A.M. The role of acute hypercapnia on mortality and short-term physiology in patients mechanically ventilated for ARDS: A systematic review and meta-analysis. Intensive Care Med. 2022, 48, 517–534. [Google Scholar] [CrossRef] [PubMed]
  57. Schwager, E.; Jansson, K.; Rahman, A.; Schiffer, S.; Chang, Y.; Boverman, G.; Gross, B.; Xu-Wilson, M.; Boehme, P.; Truebel, H.; et al. Utilizing machine learning to improve clinical trial design for acute respiratory distress syndrome. NPJ Digit. Med. 2021, 4, 133. [Google Scholar] [CrossRef] [PubMed]
  58. Nuckton, T.J.; Alonso, J.A.; Kallet, R.H.; Daniel, B.M.; Pittet, J.-F.; Eisner, M.D.; Matthay, M.A. Pulmonary Dead-Space Fraction as a Risk Factor for Death in the Acute Respiratory Distress Syndrome. N. Engl. J. Med. 2002, 346, 1281–1286. [Google Scholar] [CrossRef]
  59. Pozzi, T.; Fratti, I.; Tomarchio, E.; Bruno, G.; Catozzi, G.; Monte, A.; Chiumello, D.; Coppola, S. Early time-course of respiratory mechanics, mechanical power and gas exchange in ARDS patients. J. Crit. Care 2024, 79, 154444. [Google Scholar] [CrossRef]
  60. Steyerberg, E.W.; Vickers, A.J.; Cook, N.R.; Gerds, T.; Gonen, M.; Obuchowski, N.; Pencina, M.J.; Kattan, M.W. Assessing the performance of prediction models: A framework for some traditional and novel measures. Epidemiology 2010, 21, 128–138. [Google Scholar] [CrossRef]
Figure 1. Flow chart representing the study design of the PIONEER study. The diagram illustrates the scheme for the database with 1173 patients with moderate-to-severe acute respiratory distress syndrome (ARDS), the selection of variables for final analysis, machine learning approaches, and comparison among the prediction models. Once the most relevant variables were selected by genetic algorithms in the dataset of 920 patients, and the model with the best performance (the highest area of the receiver characteristic curve, AUC) was identified, this dataset was divided into five folders to perform 5-fold randomized cross-validation repeated 100 times using machine learning techniques. Abbreviations: AUC: area under the receiver operating characteristic curve; GA: genetic algorithm: LR, logistic regression; MLP, Multilayer Perceptron; RF, Random Forest; SVM, Support Vector Machine; T0, time zero (at the time of diagnosis of moderate/severe ARDS; T24, at 24 h after diagnosis under standardized ventilator settings; and T72, at 72 h after diagnosis.
Figure 1. Flow chart representing the study design of the PIONEER study. The diagram illustrates the scheme for the database with 1173 patients with moderate-to-severe acute respiratory distress syndrome (ARDS), the selection of variables for final analysis, machine learning approaches, and comparison among the prediction models. Once the most relevant variables were selected by genetic algorithms in the dataset of 920 patients, and the model with the best performance (the highest area of the receiver characteristic curve, AUC) was identified, this dataset was divided into five folders to perform 5-fold randomized cross-validation repeated 100 times using machine learning techniques. Abbreviations: AUC: area under the receiver operating characteristic curve; GA: genetic algorithm: LR, logistic regression; MLP, Multilayer Perceptron; RF, Random Forest; SVM, Support Vector Machine; T0, time zero (at the time of diagnosis of moderate/severe ARDS; T24, at 24 h after diagnosis under standardized ventilator settings; and T72, at 72 h after diagnosis.
Jcm 13 01811 g001
Table 1. Descriptive characteristics of 1173 patients with moderate/severe ARDS at the time of diagnosis (T0) at 24 h (T24) and 72 h (T72) after diagnosis of moderate/severe ARDS.
Table 1. Descriptive characteristics of 1173 patients with moderate/severe ARDS at the time of diagnosis (T0) at 24 h (T24) and 72 h (T72) after diagnosis of moderate/severe ARDS.
VariableDevelopment Cohort Testing Cohort
T0
n = 920
T24
n = 920
T72
n = 920
T0
n = 253
T24
n = 253
T72
n = 253
SOFA score8.9 ± 3.38.9 ± 3.57.9 ± 3.99.7 ± 3.59.4 ± 3.98.3 ± 4.4
FiO20.79 ± 0.190.65 ± 0.170.63 ± 0.200.76 ± 0.200.62 ± 0.160.57 ± 0.19
PaO2, mmHg85.9 ± 25.891.8 ± 26.8108.1 ± 35.286.6 ± 25.398.3 ± 30.5107.6 ± 32.9
PaO2/FiO2, mmHg114.4 ± 37.5150.2 ± 57.3196.0 ± 81.7120.8 ± 40.3170.0 ± 65.9221.5 ± 84.6
PaCO2, mmHg49.1 ± 12.546.6 ± 10.148.1 ± 9.550.3 ± 13.747.1 ± 9.947.2 ± 9.2
pH7.31 ± 0.117.35 ± 0.087.40 ± 0.087.30 ± 0.117.35 ± 0.087.40 ± 0.07
VT, ml/kg/PBW6.9 ± 1.06.8 ± 0.96.9 ± 1.16.7 ± 1.16.5 ± 1.16.6 ± 1.2
Respiratory rate21.4 ± 4.923.3 ± 5.024.7 ± 5.222.3 ± 4.423.4 ± 4.824.2 ± 4.9
PEEP, cmH2O12.1 ± 3.312.5 ± 3.011.5 ± 3.311.1 ± 3.011.8 ± 2.810.7 ± 3.0
Pplat, cmH2O26.4 ± 4.926.6 ± 4.624.5 ± 4.725.0 ± 4.924.7 ± 4.522.4 ± 4.4
Driving pressure, cmH2O14.4 ± 4.914.2 ± 4.513.0 ± 4.414.0 ± 4.512.9 ± 4.211.6 ± 3.8
No. extrapulmonary OF1.6 ± 1.11.7 ± 1.11.5 ± 1.21.9 ± 1.11.9 ± 1.21.6 ± 1.3
ARDS: acute respiratory distress syndrome, OF: organ failures, PBW: predicted body weight, PEEP: positive end-expiratory pressure, Pplat: inspiratory plateau pressure, SOFA: sequential organ failure assessment, T0: time of diagnosis of moderate/severe ARDS, T24: at 24 h of moderate/severe ARDS diagnosis, T72: at 72 h of moderate/severe ARDS diagnosis, and VT: tidal volume.
Table 2. Comparison of clinically relevant variables at the baseline (T0) in 920 patients of the development cohort in relation to the duration of mechanical ventilation from the time of diagnosis of moderate/severe ARDS.
Table 2. Comparison of clinically relevant variables at the baseline (T0) in 920 patients of the development cohort in relation to the duration of mechanical ventilation from the time of diagnosis of moderate/severe ARDS.
VariableMV 3–14 Days
n = 479
MV > 14 Days
n = 441
Mean Difference
(95%CI)
p
Age, yr, mean ± SD56.0 ± 16.857.1 ± 15.01.1 (−1 to 3.2)0.292 *
Sex, n (%) 0.050
Male311 (64.9)313 (71.0)6.1 (0.1 to 12.0)
Female168 (35.1)128 (29.0)6.1 (0.1 to 12.0)
Etiology, n (%) 0.124
Pneumonia226 (47.2)232 (52.6)5.4 (−1.6 to 11.8)
Sepsis142 (29.6)104 (23.6)6.0 (0.3 to 11.6)
Aspiration51 (10.6)39 (8.8)1.8 (−2.1 to 5.6)
Trauma37 (7.7)35 (7.9)0.2 (−3.3 to 3.8)
Others23 (4.8)31 (7.0)2.2 (−0.9 to 5.4)
Degree of severity, n (%) 0.195
Severe183 (38.2)187 (42.4)4.2 (−2.1 to 10.5)
Moderate296 (61.8)254 (57.6)4.2 (−2.1 to 10.5)
APACHE II score20.2 ± 6.620.6 ± 6.10.4 (−0.4 to 1.2) 0.352 *
SOFA score8.7 ± 3.59.1 ± 3.10.4 (−0.01 to 0.8)0.055 *
FiO20.79 ± 0.190.79 ± 0.190 (0 to 0)1.0 *
PaO2, mmHg87.8 ± 26.883.7 ± 24.5−4.1 (−7.5 to −0.8)0.016 *
PaO2/FiO2, mmHg117.3 ± 38.4111.4 ± 36.3−5.9 (−10.7 to −1.1)0.017 *
PaCO2, mmHg48.4 ± 12.549.8 ± 12.41.4 (−0.1 to 3.1)0.074 *
pH7.31 ± 0.117.31 ± 0.100.0 (−0.01 to 0.01)1.0 *
VT, ml/Kg PBW6.9 ± 1.06.9 ± 1.00.0 (−0.1 to 0.1)0.770 *
Respiratory rate, cycles/min21.1 ± 4.921.7 ± 4.90.6 (−0.1 to 1.2)0.093 *
PEEP, cmH2O11.8 ± 3.312.3 ± 3.40.5 (0.03 to 0.9)0.036 *
Plateau pressure, cmH2O26.2 ± 5.026.6 ± 4.70.4 (−0.1 to 1.1)0.119 *
Driving pressure, cmH2O14.4 ± 4.914.4 ± 4.90 (−1.0 to 1.0)0.852 *
No. extrapulmonary OF1.5 ± 1.11.7 ± 1.00.2 (0.01 to 0.29)0.030 *
All-cause ICU mortality, n (%)155 (32.4)152 (34.5)2.1 (−4.0 to 8.2)0.500
APACHE: acute physiology and chronic health evaluation, ARDS: acute respiratory distress syndrome, MV: mechanical ventilation, OF: organ failures, PBW: predicted body weight, PEEP: positive end-expiratory pressure, SD: standard deviation, SOFA: sequential organ failure assessment, VT: tidal volume. (*) Student’s t-test; and () Fisher’s exact test.
Table 3. Comparison of analysis of performance using Random Forest, Support Vector Machine, Multilayer Perceptron, and logistic regression of the optimum model at T0, T24, and T72 in 920 patients with moderate/severe ARDS.
Table 3. Comparison of analysis of performance using Random Forest, Support Vector Machine, Multilayer Perceptron, and logistic regression of the optimum model at T0, T24, and T72 in 920 patients with moderate/severe ARDS.
TimeMethodsModelAUC (95%CI)SensitivitySpecificityAccuracyPPVNPV
T0
(minimizing AIC)
Multilayer Perceptron9-variable0.66 (0.60–0.72)0.600.630.620.600.63
Random Forest9-variable0.54 (0.47–0.60)0.460.600.530.510.55
Support Vector Machine9-variable0.51 (0.43–0.59)0.370.660.520.500.53
Logistic regression9-variable0.58 (0.52–0.64)0.630.550.590.580.64
T24
(minimizing AIC)
Multilayer Perceptron6-variable0.63 (0.56–0.69)0.760.440.590.560.67
Random Forest6-variable0.54 (0.47–0.61)0.470.590.530.510.55
Support Vector Machine6-variable0.51 (0.43–0.59)0.400.640.520.500.54
Logistic regression6-variable0.58 (0.52–0.64)0.660.520.590.570.64
T72 (minimizing AIC)Multilayer Perceptron11-variable0.71 (0.65–0.76)0.530.740.640.650.63
Random Forest11-variable0.61 (0.55–0.68)0.460.680.570.570.58
Support Vector Machine11-variable0.55 (0.40–0.65)0.240.840.550.600.55
Logistic regression11-variable0.63 (0.57–0.69)0.640.610.620.610.66
T72 (minimizing BIC)Multilayer Perceptron4-variable0.63 (0.57–0.70)0.490.690.600.600.60
Random Forest4-variable0.58 (0.50–0.65)0.480.620.550.540.57
Support Vector Machine4-variable0.53 (0.41–0.63)0.500.580.540.520.56
Logistic regression4-variable0.62 (0.56–0.68)0.650.580.610.600.66
AIC: Akaike information criterion, AUC: area under the receiver operating characteristic curve, BIC: Bayesian information criterion, CI: confidence intervals, NPV: negative predictive value, and PPV: positive predictive value.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Villar, J.; González-Martín, J.M.; Fernández, C.; Soler, J.A.; Ambrós, A.; Pita-García, L.; Fernández, L.; Ferrando, C.; Arocas, B.; González-Vaquero, M.; et al. Predicting the Length of Mechanical Ventilation in Acute Respiratory Disease Syndrome Using Machine Learning: The PIONEER Study. J. Clin. Med. 2024, 13, 1811. https://doi.org/10.3390/jcm13061811

AMA Style

Villar J, González-Martín JM, Fernández C, Soler JA, Ambrós A, Pita-García L, Fernández L, Ferrando C, Arocas B, González-Vaquero M, et al. Predicting the Length of Mechanical Ventilation in Acute Respiratory Disease Syndrome Using Machine Learning: The PIONEER Study. Journal of Clinical Medicine. 2024; 13(6):1811. https://doi.org/10.3390/jcm13061811

Chicago/Turabian Style

Villar, Jesús, Jesús M. González-Martín, Cristina Fernández, Juan A. Soler, Alfonso Ambrós, Lidia Pita-García, Lorena Fernández, Carlos Ferrando, Blanca Arocas, Myriam González-Vaquero, and et al. 2024. "Predicting the Length of Mechanical Ventilation in Acute Respiratory Disease Syndrome Using Machine Learning: The PIONEER Study" Journal of Clinical Medicine 13, no. 6: 1811. https://doi.org/10.3390/jcm13061811

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop