Next Article in Journal
Behind the Screens: A Systematic Literature Review on Barriers and Mitigating Strategies for Combating Cyberbullying
Next Article in Special Issue
Applications of Generative Artificial Intelligence in Electronic Medical Records: A Scoping Review
Previous Article in Journal
Exploring the Potential of BCI in Education: An Experiment in Musical Training
Previous Article in Special Issue
Prescribing the Future: The Role of Artificial Intelligence in Pharmacy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Artificial Intelligence in Bronchopulmonary Dysplasia: A Review of the Literature

1
Long School of Medicine, University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
2
College of Nursing, University of Florida Health, Gainesville, FL 32610, USA
*
Author to whom correspondence should be addressed.
Information 2025, 16(4), 262; https://doi.org/10.3390/info16040262
Submission received: 31 December 2024 / Revised: 13 March 2025 / Accepted: 14 March 2025 / Published: 24 March 2025
(This article belongs to the Special Issue Artificial Intelligence and Data Science for Health)

Abstract

:
Bronchopulmonary dysplasia (BPD) is a neonatal lung condition predominantly affecting preterm infants. Researchers have turned to computational tools, such as artificial intelligence (AI) and machine learning (ML), to better understand, diagnose, and manage BPD in patients. This study aims to provide a comprehensive summary of current AI applications in BPD risk stratification, treatment, and management and seeks to guide future research towards developing practical and effective computational tools in neonatal care. This review highlights breakthroughs in predictive modeling using clinical-, genetic-, biomarker-, and imaging-based markers. AI has helped advance BPD management strategies by optimizing treatment pathways and prognostic predictions through computational modeling. While these developments become increasingly clinically applicable, numerous challenges remain in data standardization, external validation, and the equitable integration of AI solutions into clinical practice. Addressing ethical considerations, such as data privacy and demographic representation, as well as other practical considerations will be essential to ensure the proper implementation of AI clinical tools. Future research should focus on prospective, multicenter studies, leveraging multimodal data integration to enhance early diagnosis, personalized interventions, and long-term outcomes for neonates at risk of BPD.

1. Background

1.1. Bronchopulmonary Dysplasia

Bronchopulmonary dysplasia (BPD) was first described by Northway et al., 1967 as a lung disease in preterm infants who initially had hyaline membrane disease, now commonly known as respiratory distress syndrome (RDS) [1]. Over the subsequent decades, the clinical manifestation and understanding of BPD have undergone substantial transformation, paralleling advances in neonatal intensive care and respiratory therapeutic interventions.
BPD constitutes a complex clinical syndrome of postnatal lung injury characterized by disrupted alveolarization and microvascular development [2]. Pathological examinations reveal distinct anatomical alterations, including regions of cystic emphysema, fibrotic changes, diminished alveolar formation, and structural airway modifications [2,3,4]. These morphological changes result in profound physiological consequences, such as impaired gas exchange, compromised pulmonary mechanics, and potential long-term sequelae, such as persistent hypoxia, reduced exercise tolerance, and increased susceptibility to chronic respiratory conditions [2].
Epidemiological investigations consistently demonstrate prematurity and low birth weight as the most significant predictive risk factors. According to Younge et al., 2021, approximately 80% of infants born at 22–24 weeks of gestation develop BPD, contrasting with a 20% incidence among infants born at 28 weeks [5]. Additional risk factors are outlined in Table 1 [2,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22]. Emerging research suggests a significant genetic component, with ongoing investigations exploring potential genetic markers and molecular mechanisms underlying BPD pathogenesis [20,21,22].
The pathophysiological mechanism of BPD is multifactorial. Postnatal respiratory function critically depends on the interface between pulmonary alveolar and endothelial cells [2]. A deficiency of pulmonary surfactant after birth is a primary cause of RDS in preterm infants [23]. However, therapeutic modalities, such as mechanical ventilation, continuous positive airway pressure, and supplemental oxygen, while essential for initial stabilization, simultaneously introduce potential mechanisms of pulmonary inflammation and pulmonary developmental arrest [24].
The diagnostic criteria of BPD have changed over the years. The previous definition was based on the persistent need for supplemental oxygen at a postnatal age of 28 days or a postmenstrual age (PMA) of 36 weeks; newer definitions are more nuanced [1]. In 2018, the National Institute of Child Health and Human Development (NICHD) revised the definition of BPD by eliminating the requirement for 28 days of oxygen therapy before 36 weeks PMA, adding a requirement for the radiographic confirmation of parenchymal lung disease and using a severity grading based on supplemental oxygen to incorporate newer modes of non-invasive ventilation [14]. In 2019, Jensen et al. proposed modifications to the NICHD definition. Their scale defined the stages of BPD, at 36 weeks PMA, as follows: (1) no BPD (no support required), (2) grade 1 (nasal cannula ≤ 2 L/min), (3) grade 2 (nasal cannula > 2 L/min or non-invasive positive airway pressure), and (4) grade 3 (invasive mechanical ventilation) [7].
Contemporary clinical interventions focus on minimizing pulmonary trauma and optimizing respiratory outcomes. The Caffeine for Apnea of Prematurity (CAP) trial by Schmidt et al. demonstrated significant reductions in ventilator dependency [25]. At the same time, targeted oxygen saturation management (91–95%) has shown promise in mitigating oxidative lung damage. Emerging therapeutic strategies, including less invasive surfactant administration and standardized glucocorticoid protocols, represent additional advances in BPD management [26,27,28].
Despite advances in early respiratory care, some preterm infants develop chronic respiratory disease and related comorbidities that persist after discharge [2,29]. These patients may require additional interventions, such as continued supplemental oxygen therapy, respiratory medications, or in severe cases, tracheostomy placement along with home mechanical ventilation. Approximately 50% of patients experience rehospitalization within their initial two years, with heightened vulnerability to respiratory viral infections [30,31]. Long-term sequelae include increased risks of reactive airway disease, potential obstructive or restrictive lung pathologies, and associated developmental complications as a result of chronic hypoxia [32,33,34,35]. The trajectory of BPD research underscores the complexity of premature infant pulmonary development. Continued interdisciplinary investigation remains critical to elucidating comprehensive prevention and management strategies for this challenging clinical syndrome. New approaches are needed to better optimize early BPD diagnosis and the management of patient care, and advancements in artificial intelligence offer promising tools to address these challenges.

1.2. Artificial Intelligence and Applications in Healthcare

Artificial intelligence (AI) is the simulation of human intelligence by machines to perform tasks typically requiring human cognitive functions, such as problem solving, learning, and decision making [36]. AI systems utilize algorithms and computational models to interpret data and execute actions autonomously or semi-autonomously. AI technology holds promise across various sectors, including healthcare, due to its ability to manage large datasets, provide real-time analysis, and improve efficiency and accuracy in decision making [36]. The field of AI consists of a vast number of subfields, each with specific approaches, methodologies, and use cases, as outlined in Table 2 [37,38,39].
AI involvement in healthcare dates back to the 1970s, with early attempts focusing on rule-based systems. Programs such as MYCIN, developed at Stanford University, aimed to assist physicians in diagnosing infectious diseases by offering antibiotic recommendations [40]. The field gained momentum in the 2000s with the advent of more advanced computational tools, larger datasets, such as electronic health records (EHRs), and breakthroughs in machine learning algorithms [41]. Advances in deep learning (DL) further propelled AI’s role in healthcare, particularly in medical imaging. For instance, DL algorithms have been shown to perform at or above the level of human experts in identifying conditions such as pneumonia in radiographic images [42]. Today, AI is used in various applications, such as medical imaging, predictive analytics, clinical decision support systems, and personalized medicine [41,42,43]. Figure 1 highlights the conventional research workflow to developing, optimizing, and deploying a predictive ML model.
The rapid expansion of the AI field has also made its way into neonatology and has impacted patient care. Numerous neonatology researchers in recent years have employed ML technology to assist in the risk stratification, early diagnosis, and treatment management of BPD patients. These studies have helped researchers construct a deeper understanding of the disease mechanism of BPD and have assisted in the proper management of patient care. This review aims to synthesize existing models to evaluate the role of AI in addressing the challenges of BPD patient care and management. Specifically, this study summarizes the existing applications of AI in stratifying BPD risk, predicting BPD diagnosis, and managing BPD treatment and identifies areas for further research in order to develop more practical and effective computational tools for neonatal care.

1.3. Methods

To systematically assess the role of AI in predicting BPD, we conducted a review of the literature. A search was performed across the following databases—PubMed, CINAHL, Research Rabbit, Web of Science, and Scopus—to identify relevant studies published between 1 January 2021 and 15 March 2024. To ensure the inclusion of the most recent advancements in AI-driven BPD prediction, we focused on studies published from 2021 onward. This time frame captures the rapid evolution of deep learning techniques and external validation strategies that were not well-established in earlier studies. The following Boolean search terms were used: (“Bronchopulmonary Dysplasia” OR “BPD”) AND (“Artificial Intelligence” OR “Machine Learning” OR “Neural Networks”) AND (“Premature” OR “Neonate” OR “Newborn”).
Studies were included if they (1) developed or validated an AI/ML model for predicting BPD in neonates, (2) utilized clinical, laboratory, or imaging-based features, and (3) reported performance metrics, such as AUC, sensitivity, specificity, or predictive accuracy. Studies were excluded if they were animal studies, in vitro experiments, case reports, or conference abstracts. Data extraction focused on study characteristics, AI model types, dataset sources, validation strategies, and performance metrics. Heterogeneity in study design, BPD definitions (2001 NICHD, 2018 NICHD, Jensen criteria), and dataset size were also analyzed.

2. Artificial Intelligence in Analyzing BPD Risk

New approaches using ML have emerged as powerful tools for identifying predisposing risk factors and developing predictive models for BPD. While traditional statistical methods have long been employed in clinical research, recent studies demonstrate the potential of ML techniques to uncover nuanced predictive markers and stratify risk more precisely and at an earlier age. For additional reference, the applications of various AI algorithms in neonatology are outlined in Table 3 [44].
Studies included datasets from seven countries, with Korea contributing the largest proportion (49.3%). While this diversity enhances generalizability, the predominance of data from high-resource settings may limit applicability to low-resource neonatal units. Logistic regression was frequently used for its interpretability, while random forest and XGBoost models were commonly used due to their robustness in handling nonlinear relationships and missing data. SVM was favored for small datasets, while deep learning models, like CNNs, were less prevalent due to their high data requirements. Deep learning models, such as CNNs, were used in imaging-based approaches due to their high accuracy in pattern recognition.
Lei et al., 2021 initiated this approach by constructing random forest (RF) models to evaluate feature selection’s impact on predictive accuracy [45]. Their study analyzed clinical data from 648 preterm infants and compared two ML models with differing feature sets to optimize performance, while limiting data requirements. They began by using the Boruta algorithm to select for clinical features. This algorithm duplicates and randomizes feature data, generates an RF decision tree-based classifier to assess feature importance, and marks important clinical features. This process is iterated multiple times to ensure stable results and returns the 12 most important features. The researchers fed these features into an RF model for evaluation and further isolated the six most important features to create a separate RF model for comparative analysis. Despite using fewer clinical variables (6 vs. 12), the 6-feature model maintained comparable accuracy (0.8958 vs. 0.9167) and showed a marginal improvement in area under the curve (AUC) from 0.922 to 0.929. However, the study was limited by its single-center design and geographical specificity to the Liuzhou Hospital in Western China, potentially restricting broader generalizability. The single-center design potentially limits generalizability as the predictive model and features identified may be due to bias within the patient population.
Building upon clinical variable approaches, Moreira et al., 2023 explored transcriptomic signatures using multiple machine learning algorithms [46]. They began by collecting microarray data for 97 very low birth weight (VLBW) infants on day 5 of life and filtered relevant genes through expression levels and statistical methods to isolate 4523 genes. By employing k-nearest neighbors (kNNs), support vector machines (SVMs), neural networks (NN)s, and extreme gradient boosting (XGBoost), they identified 14 potential gene signatures, ultimately focusing on five most predictive genes: PNPO, MSANTD2, CD4, SNX1, and P2RX7. These genes suggest a dysregulation of T cell development and function in BPD. Their XGBoost model demonstrated exceptional predictive capacity, achieving an AUC of 0.961. Remarkably, the model’s performance in diagnosing severe BPD reached an impressive AUC of 0.998. This study, however, was limited by its use of single-center data points and microarray data that are not always readily available in the clinic at birth. Additionally, the study’s performance could not be compared to existing BPD outcome estimators due to the lack of clinically relevant data points, such as respiratory support.
Montagna et al., 2024 provided a critical analysis between conventional statistical methods and ML approaches [47]. Traditional statistical analysis highlighted prenatal variables, like absent or reversed end-diastolic flow velocity (AREDV), ductus venosus flow alterations, and chorioamnionitis, as key predictors. In contrast, their ML models, particularly the XGBoost classifier, identified a slightly different set of most relevant variables, as follows: extremely low birth weight, gestational age, AREDF status, magnesium sulfate prophylaxis, and mechanical ventilation status. The convergence of both methodologies on certain predictors—extremely low birth weight, gestational age, mechanical ventilation, and AREDF status—strengthens the reliability of these risk factors. The additional risk factors identified through ML methods were considered less reliable due to the poor precision of the algorithms.
Using a different type of ML, Moreira et al., 2023 expanded their investigation on BPD by using unsupervised clustering techniques [48]. They collected microarray data and identified differentially expressed genes based on “BPD subgroups”. They separated BPD cases into four distinct endotypes (A, B, C, and D) based on 7319 genes. By creating clusters or groupings of unlabeled datapoints, they identified a set of 1207 genes that overlapped across all clusters. Afterwards, supervised learning identified 20 gene markers that could predict the different endotypes, BPD clusters based on genetic data. Despite these promising findings, the clinical translation of such complex genetic stratification remains challenging and requires further investigation. Their model also focuses on retrospective data analysis, and no current studies evaluate prospective BPD risk through endotyping.
These studies collectively demonstrate the potential of ML in BPD research. Advanced computational techniques allow researchers to identify more precise and personalized risk prediction models. However, significant challenges remain, including the need for larger, more diverse datasets, comprehensive validation across different clinical settings, and the development of clinically implementable predictive tools.

3. Artificial Intelligence in BPD Diagnosis

Emerging AI technology has also aided in the early diagnosis of BPD. Using patient data available at birth or during pre-birth screenings, researchers can now leverage computational approaches to develop models that explore clinical, genetic, and imaging-based markers to accurately predict BPD diagnosis.
In one of the earlier BPD predictive models, Jassem-Bobowicz et al., 2021 developed a multivariate risk assessment approach using 278 preterm infants [49]. Their model focused on clinical variables, including gestational age, red blood cell transfusions, surfactant administration, and patent ductus arteriosus. Although the researchers did not leverage AI technology, they achieved a high AUC of 0.932, demonstrating the potential of carefully selected predictive variables in BPD risk assessment. Their study created a risk-scoring system for BPD and maintained a high AUC, while being simple and feasible in clinical practice. Their predictive model is, however, limited by their low sample size of patient data in the different subcategories split by clinical variables.
Expanding on clinical data integration, Verder et al., 2021 introduced an innovative approach by combining clinical information with gastric aspirate sample analysis through Fourier transform infrared (FTIR) spectral analysis [50]. FTIR spectral analysis is an analytic laboratory technique that characterizes substances based on their interactions with infrared radiation. The researchers fed FTIR data alongside clinical data available at birth into an SVM model to classify each patient based on BPD diagnosis. The SVM model integrates clinical and FTIR data to position each patient within a multidimensional feature space and generates a hyperplane to differentiate between patients with and without BPD. This hyperplane facilitates the classification of patients and enables the prediction of BPD diagnoses. The resulting model achieved remarkable diagnostic performance, with 88% sensitivity and 91% specificity, offering a near-immediate predictive capability immediately after birth.
Genetic factor integration represented another significant advancement in BPD prediction. Dai et al., 2021 demonstrated the substantial predictive power of incorporating genetic risk factors into ML models [51]. Analyzing a dataset of 245 premature infants, they observed substantial improvements in model performance. They first created a risk-scoring system to identify genetic markers that are more strongly associated with both mild and severe BPD diagnoses. They identified 30 risk genes for mild BPD and 21 genes for severe BPD, including OBSL1, GNAS, TCIRG1, and C5 among others. Most of these genes are primarily involved in susceptibility to infection, immune regulation, and cellular biological function. They also collected clinical information, such as birth history, morbidities in the first week of life, early treatment after birth, morbidities during hospitalization, and treatment. The AUC increased from 0.826 to 0.907 for severe BPD prediction and from 0.814 to 0.915 for mild-to-moderate BPD classification, when genetic factors were integrated, highlighting the potential of comprehensive data approaches. While promising, their study has yet to be validated on an external dataset of patients, so it still has limited predictive value.
Biomarker-based prediction emerged as another promising avenue. Gao et al., 2023 investigated umbilical cord blood interleukin-6 (UCB IL-6) as a predictive marker across 414 preterm infants categorized by BPD severity [52]. Comparing four ML algorithms—XGBoost, CatBoost, LightGBM, and RF—they consistently identified UCB IL-6 as the most prominent feature. The XGBoost, CatBoost, and LightGBM models are all decision tree-based algorithms that use boosting. These models are trained iteratively in which successive decision trees are constructed to progressively enhance the model’s performance, with the final classification determined by an ensemble of trained decision trees. In contrast, the RF algorithm employs a bagging technique, combining the predictions of multiple decision trees to generate the classification outcome. All of the models achieved impressive AUCs ranging from 0.841 to 0.870, with CatBoost performing optimally. The study established a strong association between elevated UCB IL-6 levels and advanced BPD grades, highlighting the potential of specific biomarkers in risk stratification. Similar to the previous studies, their study was limited by a small sample size, which disrupted the model’s ability to predict the grade of BPD severity.
Deep learning models, particularly U-Net and ResNet, have demonstrated promising results in neonatal chest radiograph analysis, offering improved accuracy and reduced observer variability compared to traditional manual interpretation. U-Net excels in segmenting lung regions by extracting and highlighting key image features, while ResNet, a pre-trained convolutional neural network (CNN), is commonly employed for classification tasks due to its ability to efficiently process large images and optimize training by skipping redundant layers. Chou et al. (2024) introduced a novel AI-driven BPD prediction model that leverages U-Net for lung segmentation and ResNet for classification, using radiographs taken within 24 h of birth from preterm and term infants at the National Cheng Kung University Hospital [53]. Their model achieved an average F1 score of 0.80, consistently outperforming human prediction in BPD diagnosis. Despite these promising results, the study was constrained by its retrospective, single-center design, limited dataset, and a lack of external validation. Furthermore, neonatal lung image analysis presents inherent challenges, including variability in radiograph quality and anatomical differences in preterm infants. These findings emphasize the need for multicenter collaborations and adaptive machine learning approaches to enhance model generalizability and clinical applicability. While AI models often outperform clinical scoring systems and logistic regression in predictive accuracy, the black-box nature of deep learning remains a significant hurdle. Balancing AI’s predictive power with clinician trust through explainability techniques will be crucial for successful real-world implementation.
These studies collectively illustrate the capability of AI in BPD diagnostics. The models are summarized in Table 4 [50,51,52,53]. Through the integration of diverse data sources—clinical variables, genetic markers, biomarkers, and imaging data—researchers are developing increasingly precise predictive models. Areas of future research include large-scale validation, the standardization of methodologies, and the translation of these computational approaches into clinical practice.

4. Artificial Intelligence in BPD Management and Treatment

Modern computational ML approaches are also being leveraged to address critical challenges in BPD management, offering unprecedented insights into patient outcomes, disease severity prediction, and treatment optimization strategies.
Leigh et al., 2022 conducted a comprehensive computational analysis of 689 preterm infants, employing ensemble modeling techniques to elucidate BPD-free survival predictors [54]. Their statistical modeling achieved a robust AUC of 0.899, systematically demonstrating that reduced gestational age and prolonged intubation durations significantly correlated with diminished survival probabilities. The research identified continuous positive airway pressure post-extubation as a superior intervention compared to non-invasive mechanical ventilation, offering evidence-based guidance through AI tools for BPD respiratory support management in clinical practice.
Extending computational methodologies to radiographic analysis, Xing et al., 2022 implemented advanced DL architectures for early BPD severity [55]. Utilizing transfer learning and sophisticated NN segmentation, they developed a predictive model capable of determining BPD severity at 28 days—substantially earlier than traditional 56-day clinical assessments. Through a comprehensive evaluation of six NN architectures, the VGG-16 network demonstrated an exceptional diagnostic accuracy of 95.58%, substantiating the potential of advanced computational imaging techniques in neonatal respiratory diagnostics. Similar to the ResNet model, VGG-16 is a pre-trained CNN that is known for its simplicity and depth and its ability to process and classify images. This study highlights the application of DL approaches in the predictive abilities of BPD severity.
Wu et al., 2023 developed a comprehensive prognostic risk stratification model utilizing the Taiwan Neonatal Network’s (TNN’s) extensive dataset of 3200 infants [56]. They collected 24 early-life clinical characteristics for features for the model and graded the severity of the initial neonatal resuscitation. The researchers then systematically evaluated seven ML algorithms and constructed a predictive framework with logistic regression, exhibiting the most statistically robust performance. The model’s AUC values ranged from 0.763 to 0.881 across various outcome categories, with important predictive features including birth weight, gestational age, initial resuscitation intubation characteristics, early sepsis manifestations, and surfactant administration. This study is, however, limited by the geographical specificity of the TNN database and potential collinearity of clinical variables that might serve as a confounding variable when identifying the critical predictive features of late respiratory support.
Addressing practical clinical challenges, Tao et al., 2024 developed a targeted ML algorithm for predicting extubation failure in BPD-diagnosed neonates [57]. They first separated their dataset based on the outcome of extubation (success or failure) and leveraged conventional statistical methods to compare clinical data points. Analyzing 284 mechanically ventilated infants, the researchers employed six computational approaches, with XGBoost demonstrating superior predictive capabilities, achieving an AUC of 0.873. The study identified partial oxygen pressure, hemoglobin levels, and mechanical ventilation status as the most significant parameters. This study is limited due to conflicting definitions of extubation failure and also must be validated through a multicenter prospective study to assess generalizability.
These computational investigations collectively demonstrate the power of ML methodologies in BPD management. Researchers are integrating complex multidimensional clinical, physiological, and imaging datasets and are developing increasingly sophisticated predictive models that promise to revolutionize neonatal respiratory care strategies. However, there are still numerous obstacles to fully implementing AI in clinical treatment.

Multimodal Data Integration in BPD Prediction

BPD is a multifactorial disease influenced by clinical, imaging, genetic, and physiological factors, making multimodal data integration a critical approach for improving predictive accuracy. Traditional ML models often rely on single-source data, such as EHRs or imaging, limiting their predictive capability. Recent advancements in AI and deep learning have enabled models to integrate multiple data types, improving their robustness and applicability in neonatal care. Clinical AI models incorporating gestational age, birth weight, ventilatory support settings, and laboratory markers have demonstrated superior predictive power compared to conventional logistic regression models [7,14]. For instance, deep learning-based radiomics has enabled the extraction of quantitative lung imaging biomarkers from neonatal chest radiographs, facilitating early disease detection [4,50]. A CNN model combining chest X-ray features with clinical data improved BPD risk stratification, outperforming models trained on a single modality [53,55]. Beyond imaging, genetic and transcriptomic data are emerging as important components of BPD prediction. Genome-wide association studies (GWAS) have identified genetic loci related to lung development and inflammatory responses, such as VEGF, IL-6, and ANGPT2, which are implicated in BPD pathogenesis [12,13]. Multiomics approaches integrating RNA sequencing and proteomics data with clinical risk factors have demonstrated potential in identifying high-risk neonates [46,51]. Additionally, continuous physiological monitoring, including oxygen saturation trends, heart rate variability, and ventilatory patterns, has been proposed as a valuable predictive tool, allowing AI-driven models to assess BPD risk dynamically [11,56].
Despite its promise, multimodal AI integration presents several challenges, including data heterogeneity, computational costs, and the need for standardized validation frameworks. Differences in EHR formats, imaging acquisition protocols, and genomic sequencing methods can hinder model generalizability [23,49]. To address these issues, federated learning techniques have been introduced, allowing AI models to be trained across multiple centers, while preserving patient privacy and compliance with HIPAA and GDPR regulations [6,58]. Furthermore, explainability in AI models remains a significant barrier to clinical adoption. Advanced interpretability techniques, such as SHAP (Shapley additive explanations) and attention-based neural networks, are being explored to improve transparency and clinician trust in AI-generated predictions [47,59]. Future research should focus on the prospective validation of multimodal AI models through large-scale, multicenter clinical trials to ensure regulatory approval and real-world applicability. By integrating clinical, imaging, genetic, and physiological data, AI-driven models can provide more accurate and individualized risk assessments, ultimately advancing early BPD prediction and neonatal care.

5. Discussion

5.1. AI vs. Traditional Statistical Methods and Research Recommendations

AI-based models have demonstrated higher predictive performance compared to traditional statistical approaches for BPD risk assessment. In multiple studies, machine learning models, such as random forest, XGBoost, and SVM achieved higher AUC values (ranging from 0.80 to 0.97) than conventional logistic regression models, which often plateau around 0.70 to 0.85. Additionally, deep learning architectures, like CNNs, have shown success in processing imaging data to predict BPD severity, an area where traditional regression-based methods fall short due to their inability to capture spatial features.
Despite these advantages, logistic regression remains widely used in clinical settings due to its interpretability and transparency. Unlike black-box AI models, logistic regression provides clear coefficients and direct associations between input variables and outcomes, making it easier for clinicians to understand and validate predictions. Moreover, logistic regression models require fewer computational resources and are less susceptible to overfitting, especially when dealing with smaller datasets or limited features.
To facilitate real-world AI adoption, several key challenges must be addressed:
  • External validation and generalizability: Many AI models for BPD prediction are developed using single-center datasets, limiting their applicability across diverse patient populations. Future research should prioritize multicenter collaborations, where models are trained and tested on heterogeneous datasets from different NICUs to enhance robustness.
  • Federated learning for data privacy: Due to strict data-sharing regulations (HIPAA, GDPR), hospitals are often unable to share patient data for AI model training. Federated learning (FL) provides a potential solution by enabling decentralized model training across multiple institutions without transferring sensitive patient data. This approach can improve model accuracy, while maintaining data security and compliance with privacy laws.
  • Standardization of AI model reporting: A major barrier to AI integration in neonatal care is the lack of uniform reporting standards for model performance metrics. While AUC, sensitivity, and specificity are commonly reported, calibration plots, confidence intervals, and decision thresholds are often omitted, making it difficult to compare models. Future research should follow established guidelines, such as TRIPOD (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis) to ensure reproducibility and facilitate regulatory approval.
  • Improving model explainability: One of the major limitations of deep learning models in BPD prediction is their lack of transparency. Implementing post hoc interpretability methods, such as SHAP (Shapley additive explanations) and LIME (local interpretable model-agnostic explanations) can help clinicians understand which features contribute most to AI predictions. For example, if an AI model predicts a high risk of BPD, SHAP can highlight whether gestational age, oxygen therapy duration, or mechanical ventilation settings were the primary contributing factors.
  • AI Integration into clinical workflows: Even the most accurate AI models hold little clinical value if they are not integrated into real-world NICU workflows. Future research should focus on developing user-friendly AI-based decision support tools that seamlessly integrate with EHRs, providing real-time risk predictions at the bedside.
By addressing these challenges, AI-driven BPD prediction models can transition from research settings to clinical practice, ultimately improving early detection and long-term outcomes for preterm infants.

5.2. Ethical and Practical Considerations

AI holds the potential to transform healthcare, particularly in the management of complex conditions, like BPD. However, the adoption of AI in NICUs raises several ethical and practical considerations that must be addressed to ensure safe, effective, and equitable care. Although AI models have demonstrated remarkable predictive power, their clinical translation remains hindered by concerns regarding data privacy, fairness, and model bias. Many AI algorithms are trained on datasets that may not be representative of diverse patient populations, leading to disparities in model performance. Training datasets must be diverse and representative of the populations affected by BPD to avoid bias. This includes data from infants of different racial and ethnic backgrounds, as well as those from varying socioeconomic and geographic contexts. Ensuring demographic diversity in the data can improve the generalizability of AI models and reduce the risk of biased or unequal treatment.
Data quality, privacy, and interoperability are crucial in ensuring the quality of AI models, which are limited by the datasets they are trained on. Data quality can vary significantly between institutions, resulting in incomplete data or inconsistent datasets. Missing data, signal noise, or variations in clinical practice can lead to poor model performance or inaccurate predictions. Healthcare institutions often use different electronic health record (EHR) systems, medical devices, and data formats, which creates challenges in integrating AI solutions. Data must be interoperable, so data can flow between departments and devices. Patient data are sensitive, and patient privacy must be conserved, as regulated by the Health Insurance Portability and Accountability Act (HIPAA) in the U.S. and similar regulations worldwide [59]. AI algorithms must meet stringent privacy standards to prevent unauthorized use or misuse of health information. This includes anonymizing datasets to protect patient identities, ensuring secure data storage, and regulating access to AI-generated insights. Sharing data between institutions or integrating third-party AI tools requires secure data-sharing protocols to prevent security breaches and navigate complex legal and regulatory frameworks [58].
Equally important are the ethical issues surrounding bias, demographic representation, and the equitable delivery of care, which impact how AI can be used to serve diverse patient populations. AI systems can inadvertently perpetuate or even exacerbate biases present in the training data [57]. For instance, if an AI model used to predict BPD risk is trained primarily on data from NICUs in high-resource settings, it may not generalize well to infants born in lower-resource settings or those from underrepresented racial or ethnic groups. This bias can lead to disparities in care, with AI potentially misclassifying or underestimating risks for certain populations. Training datasets must be diverse and representative of the populations affected by BPD to avoid bias. This includes data from infants of different racial and ethnic backgrounds, as well as those from varying socioeconomic and geographic contexts. Ensuring demographic diversity in the data can improve the generalizability of AI models and reduce the risk of biased or unequal treatment. Ensuring equitable access to AI-driven technologies is a key ethical consideration to prevent widening the gap between healthcare providers in different regions or socioeconomic environments.
Finally, practical barriers, such as the readiness of healthcare institutions, acceptance by clinical teams, and cost, must be carefully navigated to realize the full potential of AI in BPD management. Many hospitals lack the infrastructure, such as advanced computing resources or integrated data systems, to support AI applications. Even when the technology is available, there may be a lack of trained personnel to manage and interpret AI outputs.
Hospitals must invest in staff training and robust infrastructure to further AI research and applications. This may be costly, particularly for smaller hospitals or healthcare systems with limited budgets. Initiatives or policies that support equitable access to AI technologies, such as government funding or public–private partnerships, can aid in the broader adoption of AI technology. AI models should also fit seamlessly into existing clinical workflows, avoiding disruptions that could lead to inefficiencies or errors in care. Involving clinicians in developing and testing AI models can increase the trust and acceptance of the models by multidisciplinary clinical teams [37].

5.3. Future Directions and Research Opportunities

The continuous development of advanced DL models presents unprecedented opportunities for enhanced BPD risk prediction and personalized management. Building upon Mohsen et al.’s, 2022 demonstration of multimodal data integration, future research should pursue comprehensive, multidimensional approaches to predictive modeling [60]. A promising avenue involves designing prospective, multicenter randomized studies that compare traditional single-modal predictive models with integrated models combining chest radiograph transformer models, comprehensive intensive care unit clinical datasets, and genetic risk factor analyses.
Such research should aim to develop a predictive framework that integrates advanced DL processing of chest radiographs with detailed clinical data, with the specific objective of validating early-onset BPD risk assessment before 28 days of life. Inspired by personalized medicine approaches, like Liu et al.’s, 2019 radiotherapy optimization, researchers could develop longitudinal cohort studies incorporating genetic testing, comprehensive EHRs, and ML-driven treatment stratification [61]. This approach would involve creating a risk stratifier that analyzes genetic risk markers, physiological parameters, and historical treatment response data to generate individualized treatment risk scores and recommend tailored intervention strategies.
Genomic research represents another frontier for BPD predictive modeling. Large-scale multicenter genetic sequencing of preterm infants could enable the development of models capable of identifying high-risk genetic markers, predicting BPD progression, and recommending preventative interventions. Additionally, advanced imaging predictive techniques offer additional research opportunities to investigate transfer learning approaches in radiographic analysis, deep CNN customization, and automated lung development progression tracking. Researchers should develop standardized imaging protocols for preterm infant lung assessment and create ML models capable of predicting lung development trajectories with unprecedented precision.
These proposed research designs leverage the transformative potential of AI in personalized medicine. By integrating advanced computational techniques with comprehensive clinical datasets, researchers can develop more precise, individualized approaches to BPD management. However, the potential to revolutionize neonatal respiratory care through sophisticated ML approaches represents a critical advancement in pediatric medical research, promising more targeted, effective interventions for preterm infants at risk of BPD.

5.4. Limitations

This study has several limitations. Although studies included data from multiple countries, there was an overrepresentation of data from high-resource settings, particularly Korea and China. Future research should prioritize data collection from diverse healthcare environments to enhance model generalizability. Additionally, standardization remains a major challenge, as included studies utilized different datasets with variable feature selection, preprocessing steps, and outcome definitions. Harmonizing data across centers is necessary to improve AI model reproducibility and reliability. Selection bias and small sample sizes also impact model performance. Techniques such as oversampling, synthetic data generation, and transfer learning could help address these challenges.

5.5. Conclusions

Modern computational technology, such as AI and DL, have revolutionized how we understand BPD, in many cases, surpassing traditional diagnostic methods. Notable advancements include early diagnostic capabilities, with DL models predicting BPD risk up to four weeks earlier than clinical assessments, and integrated genetic and clinical models achieving AUC values above 0.85. Studies have identified the associations of BPD with genes involved in T cell development and have developed predictive models that use these gene markers with high AUCs. Other studies have identified unique clinical features, such as absent or reduced end-diastolic flow velocity in the umbilical artery as highly predictive features of BPD. Beyond risk stratification and early BPD prediction, researchers have leveraged AI to predict the course of BPD severity and post-extubation failure, and others have identified superior ventilation treatments for BPD patients. These tools offer personalized insights into disease mechanisms and interventions, leveraging data from EHRs, genetic testing, and radiological imaging for a more comprehensive approach to neonatal care. Despite these breakthroughs, challenges remain in standardization, external validation, and clinical translation. AI and ML models show promise for BPD prediction, but real-world application requires improvements in explainability and cross-institutional collaborations. Future efforts must prioritize generalizable models, diverse demographic representation, and interdisciplinary collaboration to fully realize the potential of computational approaches in improving neonatal respiratory care.

Author Contributions

Conceptualization, T.J. and A.G.M.; methodology, T.J., S.S. and A.G.M.; formal analysis, T.J., S.S. and A.G.M.; investigation, T.J., S.S. and A.G.M.; resources, A.G.M.; data curation, T.J., S.S. and A.G.M.; writing—original draft preparation, T.J. and S.S.; writing—review and editing, T.J., S.S., J.N. and A.G.M.; visualization, T.J. and S.S.; supervision, A.G.M.; project administration, A.G.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Northway, W.H.; Rosan, R.C.; Porter, D.Y. Pulmonary Disease Following Respirator Therapy of Hyaline-Membrane Disease: Bronchopulmonary Dysplasia. N. Engl. J. Med. 1967, 276, 357–368. [Google Scholar] [CrossRef] [PubMed]
  2. Thébaud, B.; Goss, K.N.; Laughon, M.; Whitsett, J.A.; Abman, S.H.; Steinhorn, R.H.; Aschner, J.L.; Davis, P.G.; McGrath-Morrow, S.A.; Soll, R.F.; et al. Bronchopulmonary Dysplasia. Nat. Rev. Dis. Primer 2019, 5, 78. [Google Scholar] [CrossRef]
  3. Dankhara, N.; Holla, I.; Ramarao, S.; Kalikkot Thekkeveedu, R. Bronchopulmonary Dysplasia: Pathogenesis and Pathophysiology. J. Clin. Med. 2023, 12, 4207. [Google Scholar] [CrossRef]
  4. Higano, N.S.; Spielberg, D.R.; Fleck, R.J.; Schapiro, A.H.; Walkup, L.L.; Hahn, A.D.; Tkach, J.A.; Kingma, P.S.; Merhar, S.L.; Fain, S.B.; et al. Neonatal Pulmonary Magnetic Resonance Imaging of Bronchopulmonary Dysplasia Predicts Short-Term Clinical Outcomes. Am. J. Respir. Crit. Care Med. 2018, 198, 1302–1311. [Google Scholar] [CrossRef]
  5. Younge, N.; Goldstein, R.F.; Bann, C.M.; Hintz, S.R.; Patel, R.M.; Smith, P.B.; Bell, E.F.; Rysavy, M.A.; Duncan, A.F.; Vohr, B.R.; et al. Survival and Neurodevelopmental Outcomes among Periviable Infants. N. Engl. J. Med. 2017, 376, 617–628. [Google Scholar] [CrossRef]
  6. Stoll, B.J.; Hansen, N.I.; Bell, E.F.; Walsh, M.C.; Carlo, W.A.; Shankaran, S.; Laptook, A.R.; Sánchez, P.J.; Van Meurs, K.P.; Wyckoff, M.; et al. Trends in Care Practices, Morbidity, and Mortality of Extremely Preterm Neonates, 1993–2012. JAMA 2015, 314, 1039–1051. [Google Scholar] [CrossRef]
  7. Jensen, E.A.; Dysart, K.; Gantz, M.G.; McDonald, S.; Bamat, N.A.; Keszler, M.; Kirpalani, H.; Laughon, M.M.; Poindexter, B.B.; Duncan, A.F.; et al. The Diagnosis of Bronchopulmonary Dysplasia in Very Preterm Infants. An Evidence-Based Approach. Am. J. Respir. Crit. Care Med. 2019, 200, 751–759. [Google Scholar] [CrossRef] [PubMed]
  8. Mestan, K.; Yu, Y.; Matoba, N.; Cerda, S.; Demmin, B.; Pearson, C.; Ortiz, K.; Wang, X. Placental Inflammatory Response Is Associated with Poor Neonatal Growth: Preterm Birth Cohort Study. Pediatrics 2010, 125, e891–e898. [Google Scholar] [CrossRef]
  9. Driscoll, D.N.; De Santi, C.; McKiernan, P.J.; McEneaney, V.; Molloy, E.J.; Greene, C.M. Expression of X-Linked Toll-like Receptor 4 Signaling Genes in Female vs. Male Neonates. Pediatr. Res. 2017, 81, 831–837. [Google Scholar] [CrossRef]
  10. Hartling, L.; Liang, Y.; Lacaze-Masmonteil, T. Chorioamnionitis as a Risk Factor for Bronchopulmonary Dysplasia: A Systematic Review and Meta-Analysis. Arch. Dis. Child. Fetal Neonatal Ed. 2012, 97, F8–F17. [Google Scholar] [CrossRef]
  11. Morrow, L.A.; Wagner, B.D.; Ingram, D.A.; Poindexter, B.B.; Schibler, K.; Cotten, C.M.; Dagle, J.; Sontag, M.K.; Mourani, P.M.; Abman, S.H. Antenatal Determinants of Bronchopulmonary Dysplasia and Late Respiratory Disease in Preterm Infants. Am. J. Respir. Crit. Care Med. 2017, 196, 364–374. [Google Scholar] [CrossRef] [PubMed]
  12. Wang, H.; St Julien, K.R.; Stevenson, D.K.; Hoffmann, T.J.; Witte, J.S.; Lazzeroni, L.C.; Krasnow, M.A.; Quaintance, C.C.; Oehlert, J.W.; Jelliffe-Pawlowski, L.L.; et al. A Genome-Wide Association Study (GWAS) for Bronchopulmonary Dysplasia. Pediatrics 2013, 132, 290–297. [Google Scholar] [CrossRef]
  13. Kwinta, P.; Bik-Multanowski, M.; Mitkowska, Z.; Tomasik, T.; Legutko, M.; Pietrzyk, J.J. Genetic Risk Factors of Bronchopulmonary Dysplasia. Pediatr. Res. 2008, 64, 682–688. [Google Scholar] [CrossRef] [PubMed]
  14. Higgins, R.D.; Jobe, A.H.; Koso-Thomas, M.; Bancalari, E.; Viscardi, R.M.; Hartert, T.V.; Ryan, R.M.; Kallapur, S.G.; Steinhorn, R.H.; Konduri, G.G.; et al. Bronchopulmonary Dysplasia: Executive Summary of a Workshop. J. Pediatr. 2018, 197, 300–308. [Google Scholar] [CrossRef] [PubMed]
  15. Shah, J.; Jefferies, A.L.; Yoon, E.W.; Lee, S.K.; Shah, P.S.; Canadian Neonatal Network. Risk Factors and Outcomes of Late-Onset Bacterial Sepsis in Preterm Neonates Born at <32 Weeks’ Gestation. Am. J. Perinatol. 2015, 32, 675–682. [Google Scholar] [CrossRef]
  16. Tyson, J.E.; Wright, L.L.; Oh, W.; Kennedy, K.A.; Mele, L.; Ehrenkranz, R.A.; Stoll, B.J.; Lemons, J.A.; Stevenson, D.K.; Bauer, C.R.; et al. Vitamin A Supplementation for Extremely-Low-Birth-Weight Infants. National Institute of Child Health and Human Development Neonatal Research Network. N. Engl. J. Med. 1999, 340, 1962–1968. [Google Scholar] [CrossRef]
  17. Oh, W.; Poindexter, B.B.; Perritt, R.; Lemons, J.A.; Bauer, C.R.; Ehrenkranz, R.A.; Stoll, B.J.; Poole, K.; Wright, L.L.; Neonatal Research Network. Association between Fluid Intake and Weight Loss during the First Ten Days of Life and Risk of Bronchopulmonary Dysplasia in Extremely Low Birth Weight Infants. J. Pediatr. 2005, 147, 786–790. [Google Scholar] [CrossRef]
  18. Ehrenkranz, R.A.; Dusick, A.M.; Vohr, B.R.; Wright, L.L.; Wrage, L.A.; Poole, W.K. Growth in the Neonatal Intensive Care Unit Influences Neurodevelopmental and Growth Outcomes of Extremely Low Birth Weight Infants. Pediatrics 2006, 117, 1253–1261. [Google Scholar] [CrossRef]
  19. Clyman, R.I.; Liebowitz, M.; Kaempf, J.; Erdeve, O.; Bulbul, A.; Håkansson, S.; Lindqvist, J.; Farooqi, A.; Katheria, A.; Sauberan, J.; et al. PDA-TOLERATE Trial: An Exploratory Randomized Controlled Trial of Treatment of Moderate-to-Large Patent Ductus Arteriosus at 1 Week of Age. J. Pediatr. 2019, 205, 41–48.e6. [Google Scholar] [CrossRef]
  20. Lavoie, P.M.; Pham, C.; Jang, K.L. Heritability of Bronchopulmonary Dysplasia, Defined According to the Consensus Statement of the National Institutes of Health. Pediatrics 2008, 122, 479–485. [Google Scholar] [CrossRef]
  21. Yu, K.-H.; Li, J.; Snyder, M.; Shaw, G.M.; O’Brodovich, H.M. The Genetic Predisposition to Bronchopulmonary Dysplasia. Curr. Opin. Pediatr. 2016, 28, 318–323. [Google Scholar] [CrossRef]
  22. Lal, C.V.; Bhandari, V.; Ambalavanan, N. Genomics, Microbiomics, Proteomics, and Metabolomics in Bronchopulmonary Dysplasia. Semin. Perinatol. 2018, 42, 425–431. [Google Scholar] [CrossRef]
  23. Hoshino, Y.; Arai, J.; Cho, K.; Yukitake, Y.; Kajikawa, D.; Hinata, A.; Miura, R. Diagnosis and Management of Neonatal Respiratory Distress Syndrome in Japan: A National Survey. Pediatr. Neonatol. 2023, 64, 61–67. [Google Scholar] [CrossRef] [PubMed]
  24. Popova, A.P. Mechanisms of Bronchopulmonary Dysplasia. J. Cell Commun. Signal. 2013, 7, 119–127. [Google Scholar] [CrossRef] [PubMed]
  25. Schmidt, B.; Roberts, R.S.; Davis, P.; Doyle, L.W.; Barrington, K.J.; Ohlsson, A.; Solimano, A.; Tin, W. Caffeine Therapy for Apnea of Prematurity. N. Engl. J. Med. 2006, 354, 2112–2121. [Google Scholar] [CrossRef] [PubMed]
  26. Zapata, H.A.; Fort, P.; Roberts, K.D.; Kaluarachchi, D.C.; Guthrie, S.O. Surfactant Administration Through Laryngeal or Supraglottic Airways (SALSA): A Viable Method for Low-Income and Middle-Income Countries. Front. Pediatr. 2022, 10, 853831. [Google Scholar] [CrossRef]
  27. Kribs, A.; Roberts, K.D.; Trevisanuto, D.; O’Donnell, C.; Dargaville, P.A. Surfactant Delivery Strategies to Prevent Bronchopulmonary Dysplasia. Semin. Perinatol. 2023, 47, 151813. [Google Scholar] [CrossRef]
  28. Wang, X.; Shi, Y. An Updated Review on Systemic Glucocorticoids in the Prevention and Treatment of Bronchopulmonary Dysplasia. Pediatr. Discov. 2023, 1, e6. [Google Scholar] [CrossRef]
  29. Homan, T.D.; Nayak, R.P. Short- and Long-Term Complications of Bronchopulmonary Dysplasia. Respir. Care 2021, 66, 1618–1629. [Google Scholar] [CrossRef]
  30. Resch, B.; Kurath-Koller, S.; Eibisberger, M.; Zenz, W. Prematurity and the Burden of Influenza and Respiratory Syncytial Virus Disease. World J. Pediatr. WJP 2016, 12, 8–18. [Google Scholar] [CrossRef]
  31. Smith, V.C.; Zupancic, J.A.F.; McCormick, M.C.; Croen, L.A.; Greene, J.; Escobar, G.J.; Richardson, D.K. Rehospitalization in the First Year of Life among Infants with Bronchopulmonary Dysplasia. J. Pediatr. 2004, 144, 799–803. [Google Scholar] [CrossRef] [PubMed]
  32. Collaco, J.M.; McGrath-Morrow, S.A. Respiratory Phenotypes for Preterm Infants, Children, and Adults: Bronchopulmonary Dysplasia and More. Ann. Am. Thorac. Soc. 2018, 15, 530–538. [Google Scholar] [CrossRef]
  33. MacLean, J.E.; DeHaan, K.; Fuhr, D.; Hariharan, S.; Kamstra, B.; Hendson, L.; Adatia, I.; Majaesic, C.; Lovering, A.T.; Thompson, R.B.; et al. Altered Breathing Mechanics and Ventilatory Response during Exercise in Children Born Extremely Preterm. Thorax 2016, 71, 1012–1019. [Google Scholar] [CrossRef]
  34. Vom Hove, M.; Prenzel, F.; Uhlig, H.H.; Robel-Tillig, E. Pulmonary Outcome in Former Preterm, Very Low Birth Weight Children with Bronchopulmonary Dysplasia: A Case-Control Follow-up at School Age. J. Pediatr. 2014, 164, 40–45.e4. [Google Scholar] [CrossRef] [PubMed]
  35. Bui, D.S.; Lodge, C.J.; Burgess, J.A.; Lowe, A.J.; Perret, J.; Bui, M.Q.; Bowatte, G.; Gurrin, L.; Johns, D.P.; Thompson, B.R.; et al. Childhood Predictors of Lung Function Trajectories and Future COPD Risk: A Prospective Cohort Study from the First to the Sixth Decade of Life. Lancet Respir. Med. 2018, 6, 535–544. [Google Scholar] [CrossRef]
  36. Hamet, P.; Tremblay, J. Artificial Intelligence in Medicine. Metabolism 2017, 69S, S36–S40. [Google Scholar] [CrossRef]
  37. Bajwa, J.; Munir, U.; Nori, A.; Williams, B. Artificial Intelligence in Healthcare: Transforming the Practice of Medicine. Future Healthc. J. 2021, 8, e188–e194. [Google Scholar] [CrossRef] [PubMed]
  38. Sharma, M.; Savage, C.; Nair, M.; Larsson, I.; Svedberg, P.; Nygren, J.M. Artificial Intelligence Applications in Health Care Practice: Scoping Review. J. Med. Internet Res. 2022, 24, e40238. [Google Scholar] [CrossRef]
  39. Hirani, R.; Noruzi, K.; Khuram, H.; Hussaini, A.S.; Aifuwa, E.I.; Ely, K.E.; Lewis, J.M.; Gabr, A.E.; Smiley, A.; Tiwari, R.K.; et al. Artificial Intelligence and Healthcare: A Journey through History, Present Innovations, and Future Possibilities. Life 2024, 14, 557. [Google Scholar] [CrossRef]
  40. Yu, V.L.; Buchanan, B.G.; Shortliffe, E.H.; Wraith, S.M.; Davis, R.; Scott, A.C.; Cohen, S.N. Evaluating the Performance of a Computer-Based Consultant. Comput. Programs Biomed. 1979, 9, 95–102. [Google Scholar] [CrossRef]
  41. Jiang, F.; Jiang, Y.; Zhi, H.; Dong, Y.; Li, H.; Ma, S.; Wang, Y.; Dong, Q.; Shen, H.; Wang, Y. Artificial Intelligence in Healthcare: Past, Present and Future. Stroke Vasc. Neurol. 2017, 2, 230–243. [Google Scholar] [CrossRef] [PubMed]
  42. Mazurowski, M.A.; Buda, M.; Saha, A.; Bashir, M.R. Deep Learning in Radiology: An Overview of the Concepts and a Survey of the State of the Art with Focus on MRI. J. Magn. Reson. Imaging JMRI 2019, 49, 939–954. [Google Scholar] [CrossRef] [PubMed]
  43. McBee, M.P.; Awan, O.A.; Colucci, A.T.; Ghobadi, C.W.; Kadom, N.; Kansagra, A.P.; Tridandapani, S.; Auffermann, W.F. Deep Learning in Radiology. Acad. Radiol. 2018, 25, 1472–1480. [Google Scholar] [CrossRef]
  44. Rahaman, M.J. A Comprehensive Review to Understand the Definitions, Advantages, Disadvantages and Applications of Machine Learning Algorithms. IJCA 2024, 186, 43–47. [Google Scholar] [CrossRef]
  45. Lei, J.; Sun, T.; Jiang, Y.; Wu, P.; Fu, J.; Zhang, T.; McGrath, E. Risk Identification of Bronchopulmonary Dysplasia in Premature Infants Based on Machine Learning. Front. Pediatr. 2021, 9, 719352. [Google Scholar] [CrossRef]
  46. Moreira, A.; Tovar, M.; Smith, A.M.; Lee, G.C.; Meunier, J.A.; Cheema, Z.; Moreira, A.; Winter, C.; Mustafa, S.B.; Seidner, S.; et al. Development of a Peripheral Blood Transcriptomic Gene Signature to Predict Bronchopulmonary Dysplasia. AJP Lung Cell. Mol. Physiol. 2022, 324, L76–L87. [Google Scholar] [CrossRef]
  47. Montagna, S.; Magno, D.; Ferretti, S.; Stelluti, M.; Gona, A.; Dionisi, C.; Simonazzi, G.; Martini, S.; Corvaglia, L.; Aceti, A. Combining Artificial Intelligence and Conventional Statistics to Predict Bronchopulmonary Dysplasia in Very Preterm Infants Using Routinely Collected Clinical Variables. Pediatr. Pulmonol. 2024, 59, 3400–3409. [Google Scholar] [CrossRef]
  48. Moreira, A.G.; Arora, T.; Arya, S.; Winter, C.; Valadie, C.T.; Kwinta, P. Leveraging Transcriptomics to Develop Bronchopulmonary Dysplasia Endotypes: A Concept Paper. Respir. Res. 2023, 24, 284. [Google Scholar] [CrossRef] [PubMed]
  49. Jassem-Bobowicz, J.M.; Klasa-Mazurkiewicz, D.; Żawrocki, A.; Stefańska, K.; Domżalska-Popadiuk, I.; Kwiatkowski, S.; Preis, K. Prediction Model for Bronchopulmonary Dysplasia in Preterm Newborns. Children 2021, 8, 886. [Google Scholar] [CrossRef]
  50. Verder, H.; Heiring, C.; Ramanathan, R.; Scoutaris, N.; Verder, P.; Jessen, T.E.; Höskuldsson, A.; Bender, L.; Dahl, M.; Eschen, C.; et al. Bronchopulmonary Dysplasia Predicted at Birth by Artificial Intelligence. Acta Paediatr. 2021, 110, 503–509. [Google Scholar] [CrossRef]
  51. Dai, D.; Chen, H.; Dong, X.; Chen, J.; Mei, M.; Lu, Y.; Yang, L.; Wu, B.; Cao, Y.; Wang, J.; et al. Bronchopulmonary Dysplasia Predicted by Developing a Machine Learning Model of Genetic and Clinical Information. Front. Genet. 2021, 12, 689071. [Google Scholar] [CrossRef] [PubMed]
  52. Gao, L.; Yang, P.; Luo, C.; Lei, M.; Shi, Z.; Cheng, X.; Zhang, J.; Cao, W.; Ren, M.; Zhang, L.; et al. Machine Learning Predictive Models for Grading Bronchopulmonary Dysplasia: Umbilical Cord Blood IL-6 as a Biomarker. Front. Pediatr. 2023, 11, 1301376. [Google Scholar] [CrossRef]
  53. Chou, H.-Y.; Lin, Y.-C.; Hsieh, S.-Y.; Chou, H.-H.; Lai, C.-S.; Wang, B.; Tsai, Y.-S. Deep Learning Model for Prediction of Bronchopulmonary Dysplasia in Preterm Infants Using Chest Radiographs. J. Imaging Inform. Med. 2024, 37, 2063–2073. [Google Scholar] [CrossRef]
  54. Leigh, R.M.; Pham, A.; Rao, S.S.; Vora, F.M.; Hou, G.; Kent, C.; Rodriguez, A.; Narang, A.; Tan, J.B.C.; Chou, F.-S. Machine Learning for Prediction of Bronchopulmonary Dysplasia-Free Survival among Very Preterm Infants. BMC Pediatr. 2022, 22, 542. [Google Scholar] [CrossRef]
  55. Xing, W.; He, W.; Li, X.; Chen, J.; Cao, Y.; Zhou, W.; Shen, Q.; Zhang, X.; Ta, D. Early Severity Prediction of BPD for Premature Infants from Chest X-Ray Images Using Deep Learning: A Study at the 28th Day of Oxygen Inhalation. Comput. Methods Programs Biomed. 2022, 221, 106869. [Google Scholar] [CrossRef]
  56. Wu, T.-Y.; Lin, W.-T.; Chen, Y.-J.; Chang, Y.-S.; Lin, C.-H.; Lin, Y.-J. Machine Learning to Predict Late Respiratory Support in Preterm Infants: A Retrospective Cohort Study. Sci. Rep. 2023, 13, 2839. [Google Scholar] [CrossRef]
  57. Tao, Y.; Ding, X.; Guo, W. Using Machine-Learning Models to Predict Extubation Failure in Neonates with Bronchopulmonary Dysplasia. BMC Pulm. Med. 2024, 24, 308. [Google Scholar] [CrossRef]
  58. Baowaly, M.K.; Lin, C.-C.; Liu, C.-L.; Chen, K.-T. Synthesizing Electronic Health Records Using Improved Generative Adversarial Networks. J. Am. Med. Inform. Assoc. JAMIA 2018, 26, 228–241. [Google Scholar] [CrossRef]
  59. Khan, B.; Fatima, H.; Qureshi, A.; Kumar, S.; Hanan, A.; Hussain, J.; Abdullah, S. Drawbacks of Artificial Intelligence and Their Potential Solutions in the Healthcare Sector. Biomed. Mater. Devices 2023, 1, 731–738. [Google Scholar] [CrossRef]
  60. Mohsen, F.; Ali, H.; El Hajj, N.; Shah, Z. Artificial Intelligence-Based Methods for Fusion of Electronic Health Records and Imaging Data. Sci. Rep. 2022, 12, 17981. [Google Scholar] [CrossRef]
  61. Liu, W.; Liu, A.; Chan, J.; Boldt, R.G.; Munoz-Schuffenegger, P.; Louie, A.V. What Is the Optimal Radiotherapy Utilization Rate for Lung Cancer?—A Systematic Review. Transl. Lung Cancer Res. 2019, 8 (Suppl. S2), S163–S171. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Machine learning model workflow.
Figure 1. Machine learning model workflow.
Information 16 00262 g001
Table 1. Risk factors for the development of bronchopulmonary dysplasia in neonates.
Table 1. Risk factors for the development of bronchopulmonary dysplasia in neonates.
CategorySpecific Factors
Perinatal FactorsExtreme prematurity (<28 weeks gestational age) [6]
Very low birth weight (<1500 g) [7]
Intrauterine growth restriction [8]
Male sex [9]
Multiple gestation [6]
Chorioamnionitis [10]
Preeclampsia [11]
Genetic FactorsFamily history of asthma or atopy [12]
Genetic polymorphisms (TGF-β, IL-4, IL-10) [13]
Ancestry-specific genetic variants [6]
Respiratory FactorsRespiratory distress syndrome [6]
Mechanical ventilation > 7 days [7]
High oxygen requirements (FiO2 > 0.4) [14]
Ventilator-induced lung injury [6]
Patent ductus arteriosus [19]
Inflammatory FactorsSystemic inflammation [6]
Early-onset sepsis [6]
Late-onset sepsis [15]
Elevated inflammatory markers (IL-6, IL-8, TNF-α) [22]
Nutritional FactorsVitamin A deficiency [16]
Fluid overload in first week of life [17]
Inadequate protein intake [6]
Poor postnatal growth [18]
Table 2. Subfields of artificial intelligence and healthcare applications.
Table 2. Subfields of artificial intelligence and healthcare applications.
AI TypeCore MethodologyExamples of Healthcare
Applications
Machine Learning [37]Pattern recognition from provided features about the dataDisease prediction
Risk stratification
Treatment response forecasting
Deep Learning [37]Leveraging complex neural network-based architectures to identify features and make predictionsMedical image analysis
Radiology diagnostics
Genomic interpretation
Expert Systems [38]Using rule-based reasoning to provide solutions in specific domainsClinical decision support
Diagnostic assistance
Treatment recommendation
Natural Language Processing [39]Linguistic processing and feature extraction of human language to analyze, classify, or generate textElectronic health record analysis
Clinical documentation
Patient communication
Table 3. Summarization of artificial intelligence algorithms, clinical use cases, strengths, and weaknesses.
Table 3. Summarization of artificial intelligence algorithms, clinical use cases, strengths, and weaknesses.
AI ModelUse CasesStrengthsWeaknesses
Logistic Regression [44]Binary outcomes (e.g., mortality, BPD diagnosis)Simple to interpret; works well with small datasets.Limited to linear relationships; struggles with complex interactions.
Random Forest (RF) [44]Classifying neonates by risk level or predicting categorical outcomes (e.g., sepsis risk)Handles non-linear data well; robust to overfitting; interpretable via feature importance.Computationally intensive with large datasets; less transparent than simpler models.
XGBoost [44]Predicting rare outcomes (e.g., long-term complications) or improving accuracy on structured dataHigh accuracy; robust with imbalanced data; interpretable using SHAP values.Requires parameter tuning; can be computationally expensive.
CatBoost [44]When dataset has categorical features (e.g., feeding methods, medications)Optimized for categorical variables; fast and handles missing data.Limited documentation compared to other models; requires careful preprocessing.
Support Vector Machine (SVM) [44]Identifying patterns in small, high-dimensional datasets (e.g., gene expression for BPD)Effective in high-dimensional spaces; works well with limited data.Difficult to interpret; computationally expensive with large datasets.
Neural Networks (NNs) [44]Complex problems with large datasets (e.g., image analysis, time-series prediction)Capture complex non-linear relationships; adaptable to various data types.Require large datasets; lack interpretability; prone to overfitting without proper tuning.
k-Nearest Neighbors (kNNs) [44]Simple classification tasks (e.g., identifying patient subgroups)Easy to implement; no training phase; work well with small datasets.Computationally expensive during prediction; sensitive to irrelevant features.
Clustering [44]Identifying subgroups in neonates (e.g., phenotyping BPD or sepsis)Unsupervised learning; helps explore hidden patterns; simple to implement.Requires pre-specification of cluster numbers; sensitive to outliers.
Linear Discriminant Analysis (LDA) [44]Classifying outcomes with clear group separations (e.g., birthweight categories)Simple and interpretable; works well with small datasets.Assumes linear separability; limited use with complex data.
Recurrent Neural Networks (RNNs) [44]Time series predictions (e.g., vital signs monitoring for sepsis or NEC)Capture temporal patterns; useful for sequential data.Require large datasets; risk of vanishing gradients; computationally expensive.
Convolutional Neural Networks (CNNs) [44]Image-based tasks (e.g., detecting pneumothorax or brain abnormalities in imaging)Excellent for image analysis; captures spatial relationships well.Require large datasets; limited interpretability; resource-intensive training.
Ensemble Models [44]Combining predictions from multiple models for tasks like risk scoringImprove accuracy and robustness; reduce overfitting.Can be complex to implement; less interpretable than single models.
Table 4. Predictive ML models for BPD diagnosis.
Table 4. Predictive ML models for BPD diagnosis.
StudyGeographic AreaSample SizeAlgorithms
Employed
Notable FeaturesValidationPerformance Metrics
Verder et al. [50]Denmark61SVMFTIR spectral analysisTraining, test split88% specificity
91% sensitivity
Dai et al. [51]China245LASSOHigh risk genes
OBSL1, GNAS,
TCIRG1, C5,
and others
10-fold CV0.907 AUC, severe BPD
0.915 AUC, mild BPD
Gao et al. [52]China414XGBoost,
CatBoost,
Light GBM,
RF
UCB-IL610-fold CV0.870 AUC
Chou et al. [53]Taiwan380U-Net, ResNetChest radiograph images5-fold CV0.80 F1 score
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jha, T.; Suhail, S.; Northcote, J.; Moreira, A.G. Artificial Intelligence in Bronchopulmonary Dysplasia: A Review of the Literature. Information 2025, 16, 262. https://doi.org/10.3390/info16040262

AMA Style

Jha T, Suhail S, Northcote J, Moreira AG. Artificial Intelligence in Bronchopulmonary Dysplasia: A Review of the Literature. Information. 2025; 16(4):262. https://doi.org/10.3390/info16040262

Chicago/Turabian Style

Jha, Tony, Sana Suhail, Janet Northcote, and Alvaro G. Moreira. 2025. "Artificial Intelligence in Bronchopulmonary Dysplasia: A Review of the Literature" Information 16, no. 4: 262. https://doi.org/10.3390/info16040262

APA Style

Jha, T., Suhail, S., Northcote, J., & Moreira, A. G. (2025). Artificial Intelligence in Bronchopulmonary Dysplasia: A Review of the Literature. Information, 16(4), 262. https://doi.org/10.3390/info16040262

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop