1. Introduction
With the rapid advancement of information technology, artificial intelligence (AI) is driving transformative changes in the healthcare sector by leveraging its unparalleled capabilities in data processing, pattern recognition, and intelligent decision-making [
1,
2,
3]. The diagnosis of infectious diseases continues to face significant challenges, including the rapid emergence of novel pathogens, the complexity of differential diagnosis, and the urgent need for speed in outbreak containment [
4,
5]. By simulating core aspects of human cognition, AI technology enables in-depth mining and precise interpretation of medical and microbiological data, providing unprecedented perspectives and powerful analytical tools for accurate pathogen identification, antibiotic resistance prediction, and epidemic trend assessment [
6,
7,
8]. Amidst the global maldistribution of medical resources, the pressing need to enhance diagnostic accuracy for complex infections, and escalating pandemic threats, AI-assisted diagnostic technology demonstrates considerable value and transformative potential [
9]. These applications span critical areas: from the rapid identification of pathogens via genomic sequencing to the automated analysis of medical imaging; and from intelligent POCT systems to real-time epidemic surveillance networks [
10,
11,
12]. Consequently, AI has transcended traditional diagnostic paradigms by enhancing data processing efficiency and recognizing complex patterns. Its unique advantages offer promising solutions to persistent clinical challenges, propelling infectious disease diagnosis toward a more efficient, precise, and proactive future [
13,
14,
15].
The application of artificial intelligence in infectious disease diagnostics has gained considerable scholarly attention in recent years, with a growing body of review literature contributing to the field. While previous reviews have often provided in-depth examinations of specific technological domains—such as medical imaging analysis or high-throughput sequencing—and established a solid knowledge base, there remains a need for comprehensive analyses that integrate multiple technological approaches. This review establishes a systematic framework that brings together three key technological directions—high-throughput sequencing, medical imaging, and point-of-care testing—and introduces a dedicated new section on Available Databases for AI-Driven Infectious Disease Diagnostics. This section surveys crucial public data resources spanning genomics, medical imaging, and clinical records, which are foundational for training and validating robust AI models. Alongside a synthesis of technological developments and these data resources, this review systematically addresses critical challenges in clinical translation—such as data quality, model generalizability, economic viability, and regulatory considerations. It is hoped that this work will provide researchers with a structured and insightful reference that balances breadth with critical depth.
2. AI in High-Throughput Sequencing for Pathogen Surveillance
The integration of real-time genomic and epidemiological surveillance is crucial for the rapid diagnosis, tracking, and control of infectious disease outbreaks. By enabling rapid analysis of massive sequencing datasets, AI technology facilitates pathogen identification, variant monitoring, and outbreak investigation, thereby transforming this field.
2.1. Pathogen Identification and Characterization
AI plays a critical role in high-throughput sequencing (HTS)-driven pathogen research, demonstrating particular strength in pathogen identification, drug resistance analysis, and quality control of sequencing data [
16,
17]. For instance, Christians et al. established an AI-driven statistical model for antimicrobial resistance (AMR) that accurately traces detected resistance genes (such as mecA, vanA, and blaCTX-M) to specific bacterial pathogens using probabilistic inference, thereby enabling culture-independent prediction of phenotypic resistance. This study also extensively incorporated in silico validation methods, demonstrating the utility and reliability of AI in the high-throughput evaluation of detection limits, as well as primer inclusivity and exclusivity [
18].
Furthermore, the AI model developed by Lin et al., which utilizes a Ct-guided nine-state Markov process based on viral Ct values, illustrates how temporal biological data can be leveraged to accurately quantify infection dynamics. Capable of reconstructing full viral kinetic trajectories from incomplete real-world observations, this model has been used to characterize transmission patterns of the Alpha and Omicron variants, estimating key epidemiological parameters such as the basic reproduction number and generation interval. It has also informed the design of targeted containment strategies. This methodology extracts meaningful patterns from high-dimensional temporal data to support pathogen surveillance and decision-making, offering a valuable framework for utilizing HTS data in pathogen identification, variant monitoring, and transmissibility assessment [
19].
Machine learning demonstrates extensive application prospects in pathogen classification/identification and multi-omics integration analysis. For example, support vector machines (SVM), random forest algorithms, and k-mer-based analytical methods enable the efficient classification and identification of microbial gene sequences (such as 16S rRNA), thereby facilitating the rapid detection of pathogens including bacteria and viruses. Moreover, artificial intelligence enables the integrative analysis of multi-omics data (spanning genomic, metabolomic, and transcriptomic domains) to elucidate underlying pathogen–host interaction mechanisms, forecast viral adaptive evolution, and identify novel therapeutic targets such as antimicrobial peptides [
20,
21]. Given the limitations of conventional bioinformatics in handling the enormous volume of data produced by high-throughput sequencing, AI technologies have emerged as essential tools for modern microbiological research. Specifically, deep learning methods significantly accelerate genomic analysis through rapid alignment against extensive microbial databases, facilitating the detection of previously uncharacterized pathogens, even those with low sequence homology to known species. Notably, during the COVID-19 pandemic, AI played a critical role in the swift identification of SARS-CoV-2 from clinical samples. Beyond detection, AI can also predict critical pathogen characteristics, such as virulence, transmissibility, and host range, based on genomic data, thereby providing crucial support for risk assessment and public health decision-making [
22] (
Figure 1).
Moreover, the utility of artificial intelligence is especially pronounced in mitigating challenges inherent to sequencing technologies. A notable example lies in addressing the elevated error rates associated with Oxford Nanopore sequencing: both the PrimalSeq wet-lab protocol and the iVar computational pipeline leverage machine learning strategies, utilizing a logistic regression classifier that incorporates mutation frequency and strand bias information to effectively suppress false positive variant calls. This approach offers a tangible framework for enhancing accuracy in long-read data characterized by high error rates. Although not yet a complete solution for data quality control, it underscores the potential of AI-driven methods to significantly improve the reliability of genomic analyses [
23].
In summary, artificial intelligence has become an indispensable component across multiple critical domains of high-throughput sequencing-based pathogen research, encompassing antimicrobial resistance gene tracking, data quality control, pathogen identification, and multi-omics integration, thereby offering advanced computational tools for data analysis and predictive modeling in infectious disease studies. Despite these advancements, the clinical translation of AI-powered HTS analysis faces practical hurdles. The dependency on high-quality sequencing data means that performance can be significantly impacted by sample degradation or low pathogen load, which is common in clinical specimens. Furthermore, the computational complexity of some deep learning models may hinder their implementation in time-sensitive diagnostic workflows or resource-limited settings where rapid turnaround is critical. Establishing standardized benchmarks and ensuring computational efficiency are therefore essential next steps.
2.2. Outbreak Investigation and Transmission Tracing
AI plays a critical role in HTS-supported outbreak investigation and transmission tracing, significantly enhancing the speed, precision, and intelligence of epidemic response [
24]. By integrating genomic data with multisource epidemiological information, AI not only enables efficient pathogen identification and variant monitoring but also demonstrates powerful capabilities in reconstructing transmission chains and predicting evolutionary dynamics [
25]. A representative case is a study utilizing nanopore sequencing (MinION) and Hidden Markov Models (HMM), which successfully achieved real-time genomic surveillance and transmission chain tracing of Ebola virus in resource-limited settings. The bioinformatic algorithm processed raw electrical signal data to effectively identify single nucleotide variations (SNVs), significantly enhancing genotyping accuracy under high-error-rate conditions, thereby providing reliable real-time tracing support for field epidemic investigations [
26]. Another study developed a CNN-BiLSTM deep learning model for classifying SARS-CoV-2 genomic sequences and identifying key regulatory motifs. Demonstrating significant outperformance over conventional methods with 99.95% accuracy and 100% AUC ROC metrics, this model rapidly distinguishes SARS-CoV-2 from other coronaviruses, highlighting AI’s substantial potential in precisely identifying pathogen-characteristic sequences within high-throughput genomic data [
27]. Expanding further to large-scale surveillance practices, a study established a high-throughput SARS-CoV-2 genomic surveillance network, achieving near-real-time viral sequencing and multi-source data integration. This system leveraged cloud computing platforms (e.g., CLIMB) to automate data processing, standardize lineage typing, and integrate clinical information, thereby providing a robust foundation for implementing AI algorithms in transmission chain reconstruction, variant dynamics monitoring, and spatiotemporal modeling, significantly enhancing the precision of outbreak traceability and response capabilities [
28].
It is important to note that the effectiveness of these AI-driven phylogenetic approaches is contingent upon sufficient genomic sampling from the outbreak; undersampled outbreaks may lead to incomplete or biased transmission chain reconstruction. In summary, through approaches such as Bayesian inference and machine learning-based clustering, AI facilitates the detection of fine-scale genetic variations and integrates multidimensional metadata, enabling accurate reconstruction of transmission chains and assessment of variant risk. It has thus emerged as an integral component of modern infectious disease surveillance systems and precision public health interventions [
29,
30].
3. AI-Driven Diagnostic Imaging Analysis
High-resolution imaging has revolutionized microbiological and pathological diagnostics by enabling rapid and detailed visualization of pathogenic organisms, thereby significantly enhancing the precision and efficiency of infectious disease diagnosis. The integration of AI, particularly through deep learning-based computer vision algorithms, has further augmented these capabilities, enabling automated, high-throughput analysis of complex diagnostic images with expert-level accuracy.
3.1. Digital Microscopy Images
AI-powered diagnostic imaging analysis is progressively transforming conventional malaria detection methods, demonstrating particular advantages in the automated identification and classification of microscope images. AI approaches, primarily deep learning and convolutional neural networks (CNNs), can efficiently process both thin and thick blood smear images to achieve automated detection, classification, and quantification of plasmodium parasites, significantly enhancing diagnostic accuracy and efficiency [
31]. Furthermore, AI has not only demonstrated exceptional performance in model accuracy but also proven its substantial value in practical applications. Integrated with smartphone and mobile device-based imaging analysis systems, it provides low-cost, easily deployable malaria screening tools for resource-limited regions, facilitating large-scale disease screening and real-time diagnosis. Moreover, such technology is not limited to malaria but also provides a transferable framework for image-assisted diagnosis of other infectious diseases. Studies have demonstrated that object detection models including YOLOv4, Faster R-CNN, and SSD300 have been successfully applied to the identification and localization of malaria-infected red blood cells in thin blood smear images. Notably, YOLOv4 has demonstrated outstanding performance across multiple evaluation metrics, achieving precision, recall, F1-score, and mean average precision (mAP) of 83%, 95%, 89%, and 93.87%, respectively, at an IoU threshold of 0.5, while exhibiting robust cross-dataset generalization capability. This approach enables end-to-end cell detection and cropping, eliminating the dependency on single-cell image preprocessing, thereby providing a reliable foundation for subsequent parasite species identification and infection stage analysis [
32].
A key challenge for deployment, however, lies in the need for consistent slide preparation and imaging conditions, as variations in staining and focus can adversely affect model performance, highlighting a dependency on standardized operational procedures. With the continuous evolution of technology, AI-based microscopic image analysis systems are becoming integral components of modern medical diagnostics. Their breakthroughs in automation, processing efficiency, and accessibility are poised to exert a profound impact on global malaria prevention and control, as well as the rapid diagnosis of other infectious diseases.
3.2. Radiological Images
The urgent need for rapid and accurate diagnostic technologies during the COVID-19 pandemic has profoundly highlighted the value of AI in medical imaging analysis. Numerous studies have systematically reviewed the application of AI in the radiological diagnosis of COVID-19-induced acute respiratory distress syndrome (ARDS) and pneumonia, spanning the entire workflow from image acquisition and automated segmentation to diagnostic classification and prognostic assessment. Utilizing primarily chest CT, chest X-ray (CXR), and lung ultrasound modalities, AI technologies—leveraging deep learning (e.g., U-Net, CNN, ResNet, DRENet), transfer learning, machine learning, and hybrid models integrating traditional radiomics—have achieved precise segmentation of lung and infected areas, differential diagnosis between COVID-19 and other types of pneumonia, as well as quantitative assessment of disease severity and patient outcomes [
33,
34,
35]. Studies have demonstrated that AI not only efficiently extracts imaging features (such as ground-glass opacity, GGO) and achieves high accuracy in multiple experiments (e.g., AUC up to 0.95, sensitivity and specificity exceeding 90%), but also exhibits excellent interpretability by visualizing critical lesion regions through attention mechanisms, thereby providing reliable decisionsupport for clinicians. Furthermore, AI systems can integrate clinical indicators (e.g., age, comorbidities, laboratory results) to enable early prediction of severe disease risk and assessment of recovery time, thereby supporting optimized allocation of medical resources and stratified patient management. These capabilities demonstrate significant potential for enhancing diagnostic efficiency and reducing healthcare-associated infection risks. Although current challenges such as limited sample sizes, inter-protocol variations in imaging, and model generalizability persist, AI technology has demonstrated significant application value and broad prospects in radiological diagnosis for both COVID-19 and future public health emergencies [
36,
37,
38,
39,
40,
41] (
Figure 2).
Clearly, AI has played multifaceted roles in COVID-19 image recognition, quantitative assessment, and prognosis prediction through multimodal and multi-architecture integration strategies, significantly enhancing the comprehensive capabilities of radiological diagnosis. With ongoing algorithm optimization and deeper clinical integration, AI is poised to become an increasingly critical tool in infectious disease control and precision medicine. Notwithstanding the promising results, the generalizability of AI models for medical imaging remains a primary concern. Models trained on data from specific hospitals or scanner protocols often experience performance degradation when applied to external datasets, a phenomenon known as domain shift. This limitation questions the readiness of many reported models for broad clinical deployment without extensive external validation. Additionally, the integration of AI tools into radiologists’ workflow requires careful design to avoid alert fatigue and ensure that AI outputs are actionable, which is an area needing more human-factor studies.
3.3. Artificial Intelligence for Nuclear Analysis (AINU)
A paradigm shift in diagnostic technology is underway, centered on moving from indirect evidence to direct observation of host cell responses. While the previous sections have discussed AI applications that enhance the detection of the pathogen itself (e.g., via sequencing or imaging) or its biomarkers (e.g., via POCT), this section introduces a fundamentally different paradigm. Artificial Intelligence for Nuclear Analysis (AINU) exemplifies a shift towards detecting the infection through the earliest morphological responses of the host cells. Traditional diagnosis of viral infections relies on detecting the pathogen itself or the immune response triggered by infection (
Figure 3). In contrast, AINU employs a fundamentally different paradigm by focusing on early-stage, subtle morphological alterations induced within host cell nuclei following viral infection. The core technology of AINU leverages advanced deep learning models to analyze high-dimensional nuclear image data acquired through super-resolution microscopy. This analysis enables the detection of infection-induced abnormalities in key nuclear components, such as alterations in histone modification patterns and spatial distribution and clustering status of RNA polymerase II. This approach demonstrates high sensitivity in identifying infected cells during the very early stages of active viral replication, prior to the emergence of significant cytopathic effects or robust immune responses. By integrating explainable artificial intelligence (XAI) techniques, AINU has identified key nuclear features that discriminate between healthy and infected cell states. This capability provides a novel analytical framework for understanding cellular heterogeneity and elucidating the mechanisms of viral infection [
42] (
Figure 4). This approach positions AINU as a potential tool for ultra-early diagnosis, operating in a timeframe prior to the accumulation of sufficient pathogen material for conventional detection or the onset of a detectable immune response. Therefore, it complements rather than replaces established methods. While AINU represents a highly specialized and emerging niche, its inclusion here underscores the diversity of AI applications moving beyond traditional paradigms toward ultra-early, morphology-based cellular diagnostics, potentially offering a complementary approach to established methods in future diagnostic workflows [
43].
4. AI-Enhanced Point-of-Care Testing (POCT)
Syndromic surveillance and multiplex detection capabilities can enhance POCT, while the integration of AI further improves its efficiency, accuracy, and accessibility.
4.1. Intelligent Syndromic Surveillance
Driven by the growing demand for precision medicine, infectious disease diagnostics is evolving toward multiplexed, rapid, and intelligent approaches. Within this context, the application of machine learning (ML) in intelligent syndromic detection, that is the simultaneous analysis of multiple biomarkers which has significantly enhanced the accuracy and efficiency of diagnosing infectious diseases. ML technologies have been integrated into various POCT platforms, including lateral flow immunoassays (LFA), vertical flow immunoassays (VFA), nucleic acid amplification tests (NAAT), and imaging sensors, thereby achieving higher sensitivity, specificity, and automation in the detection of a range of infectious diseases such as COVID-19, HIV, malaria, and Lyme disease [
44]. For instance, an intelligent detection system based on temporal deep learning architectures (e.g., TIMESAVER algorithm) integrates YOLO, CNN-LSTM, and fully connected networks to enable real-time processing of time-series images from LFA strips. This system completes detection of COVID-19 antigens and influenza A/B viruses within 1–2 min with accuracy exceeding 96%, outperforming 15 min manual interpretation while demonstrating strong versatility across multiple commercial test strips. Notably, it significantly reduces false-negative risks in low viral load samples, making it particularly suitable for time-sensitive scenarios such as emergency departments and field screening [
45]. Another representative application is the Rapid Multiplex Molecular Syndromic Panel (RMMSP), which enables simultaneous detection of multiple targets including bacterial, viral, and fungal pathogens as well as antimicrobial resistance genes. Capable of providing results within six hours, this technology significantly accelerates the diagnostic process for critically ill patients, thereby facilitating early targeted antimicrobial therapy, optimizing antibiotic stewardship, and improving clinical outcomes [
46].
Such AI-enhanced syndromic detection platforms not only achieve high-throughput, multiplex pathogen detection but also represent a paradigm shift in molecular diagnostics, evolving from traditional pathogen identification tools into intelligent diagnostic assistance systems with clinical interpretation capabilities. The promising performance of AI-enhanced POCT must be balanced against practical constraints. In resource-limited settings, where such tools are most needed, factors like device cost, maintenance requirements, and operational complexity can impede sustainable adoption. Moreover, while AI can improve test interpretation, its effectiveness is ultimately bounded by the underlying analytical performance (e.g., sensitivity/specificity) of the immunoassay or NAAT platform itself. Continuous improvement of both the physical sensing technology and the AI algorithms is therefore indispensable for realizing the full potential of intelligent syndromic surveillance.
4.2. AI-Enhanced Diagnostic Devices
Amid the wave of digital transformation reshaping healthcare, AI is emerging as a key driving force behind a paradigm shift in infectious disease diagnostics. AI-enhanced diagnostic devices have demonstrated significant advantages in the detection and surveillance of infectious diseases, particularly by improving diagnostic speed, accuracy, and accessibility. Typically deployed on POCT and wearable sensing platforms, such devices leverage machine learning techniques, including CNNs and deep learning to achieve high-sensitivity and high-specificity detection of a wide range of infectious diseases [
47]. In resource-limited settings, AI-enhanced diagnostic devices have demonstrated remarkable success. For instance, in sub-Saharan Africa, portable CNN-based devices achieved 95% sensitivity in detecting malaria parasites. In India, AI-driven sputum analysis combined with telemedicine attained 92% sensitivity for tuberculosis diagnosis while reducing the diagnosis time from weeks to hours. During COVID-19 containment, AI systems enabled rapid triage and risk stratification with high accuracy (AUC-ROC: 89%), significantly shortening isolation periods and improving patient outcomes [
48].
AI-enhanced diagnostic technologies have significantly optimized infectious disease diagnosis and control outcomes through high-precision, rapid, and accessible testing capabilities, demonstrating particularly important practical value in resource-limited settings., although long-term sustainability depends on overcoming hurdles related to device durability, connectivity, and continuous user training.
5. Available Databases for AI-Driven Infectious Disease Diagnostics
Publicly accessible, high-quality databases serve as the cornerstone for advancing AI in infectious disease diagnostics. These resources, spanning genomics, medical imaging, clinical records, and multi-omics data, provide a rich and diverse foundation for training and validating AI models, significantly enhancing the accuracy and practical utility of diagnostic tools.
5.1. Genomic and Sequencing Databases
5.1.1. GISAID (Global Initiative on Sharing All Influenza Data)
Among the publicly available data platforms for training and validating AI models, the Global Initiative on Sharing All Influenza Data (GISAID) plays a pivotal role. GISAID is a critically curated database providing annotated genomic sequences of pathogens like influenza viruses and SARS-CoV-2, coupled with essential metadata such as sampling date and geographic location. This platform is indispensable for developing AI-driven diagnostic and surveillance tools; the large-scale, well-annotated data it provides fuels machine learning models for pathogen identification, classification, and phylogenetic analysis. For instance, leveraging GISAID data, AI algorithms enable dynamic lineage nomenclature, track the evolution of variants, and reconstruct transmission chains, significantly enhancing real-time insights in genomic epidemiology. Thus, GISAID serves as an integral bridge connecting AI technology with practical infectious disease diagnostics [
49,
50,
51,
52].
5.1.2. NCBI GenBank and SRA (Sequence Read Archive)
In AI-driven infectious disease diagnostics, public databases such as NCBI GenBank and the Sequence Read Archive (SRA) serve as essential resources. GenBank hosts a vast collection of annotated nucleotide sequences from diverse pathogens, while SRAs raw high-throughput sequencing data from a wide range of clinical and environmental samples. These repositories provide a foundational dataset for training and validating AI models, particularly in tasks such as rapid pathogen identification, resistance gene detection, and metagenomic classification. For instance, k-mer-based classification tools like Kraken leverage reference genomes from GenBank to enable efficient sequence alignment and taxonomic labeling. Meanwhile, the real-time sequencing data available in SRA support AI-powered outbreak investigation and variant surveillance. The accessibility of these datasets significantly advances the application of AI in genomic epidemiology, facilitating precise diagnostics and timely public health responses [
53,
54,
55].
5.1.3. PATRIC (Pathosystems Resource Integration Center)
In AI-driven infectious disease diagnostics, the availability of high-quality, structured databases is crucial for model training and validation. PATRIC (Pathosystems Resource Integration Center) serves as a comprehensive bacterial bioinformatics resource, integrating genomic, transcriptomic, and proteomic data from over 80,000 bacterial genomes, with a strong emphasis on antimicrobial resistance (AMR) metadata and standardized annotations. The platform not only supports comparative genomics but also incorporates machine learning-based AMR phenotype prediction tools, such as k-mer-based AdaBoost classifiers for species like Mycobacterium tuberculosis and Streptococcus pneumoniae, achieving high classification accuracy. By providing antibiotic susceptibility profiles, virulence factor annotations, and host–pathogen interaction data, PATRIC enables robust genotype-phenotype correlation studies, thereby enhancing the interpretability and clinical applicability of AI models in pathogen genomics and resistance prediction [
56,
57].
5.2. Medical Imaging Databases
During the COVID-19 pandemic, the establishment of public imaging datasets significantly advanced the application of AI in infectious disease diagnostics. For instance, several publicly available chest X-ray and CT image datasets (e.g., the COVID-19 Radiography Database) integrate images from patients with COVID-19, other viral pneumonias, and healthy controls, providing a crucial foundation for training and validating deep learning models. These datasets have supported tasks such as automated lesion segmentation, disease classification, and severity assessment, while also facilitating the validation of model generalizability in real-world clinical settings. As highlighted by Wynants et al. (BMJ, 2020) [
58] in their systematic review, although many early models suffered from limited sample sizes and high risk of bias, validation studies based on public datasets have offered a vital platform for objective performance evaluation. The open sharing of such data resources has been indispensable for advancing the practical application and standardization of AI-assisted diagnostic imaging—a key focus of this review [
58].
Malaria Cell Images Dataset: In the field of malaria diagnosis, publicly available cell image datasets play a crucial role in advancing AI model development. For instance, the malaria cell image dataset hosted by platforms such as the National Institutes of Health (NIH) contains thousands of thin blood smear images collected from hundreds of patients, each meticulously annotated by experts to distinguish between parasitized and uninfected red blood cells. This dataset serves as a benchmark for training and validating convolutional neural networks (CNNs), supporting the development of lightweight models like CFPNet-M tailored for resource-constrained settings. By providing standardized, multi-center image samples, such datasets significantly enhance model generalizability in challenging scenarios including staining variations and cell overlapping, thereby laying a solid data foundation for the clinical translation of AI in malaria microscopic image diagnosis [
59,
60].
TB Portals Program: In the field of malaria diagnosis, publicly available datasets are crucial for advancing AI model development. For instance, the Malaria Cell Images Dataset, hosted on platforms such as Kaggle and the NIH, contains thousands of thin blood smear images with clear labels indicating parasitized and uninfected red blood cells. This dataset has become a benchmark for training and validating CNN-based models, enabling high-accuracy automated detection of malaria parasites. Its standardization and accessibility significantly promote the application of AI in digital microscopy image analysis, providing a reproducible foundation for algorithm development—particularly relevant to AI-enhanced POCT and intelligent diagnostic imaging discussed in this review [
61].
5.3. Clinical and Syndromic Surveillance Databases
MIMIC-IV (Medical Information Mart for Intensive Care): The availability of high-quality clinical databases is crucial for the development and validation of AI models in infectious disease diagnostics. MIMIC-IV (Medical Information Mart for Intensive Care) is a large, freely accessible database comprising de-identified electronic health records from intensive care unit patients, which includes comprehensive clinical data such as laboratory results, vital signs, medication records, and diagnostic codes. This resource provides a robust foundation for building AI models aimed at early sepsis prediction, detection of nosocomial infections, and syndromic surveillance. Particularly when integrating multi-source information—such as high-throughput sequencing (HTS), medical imaging, and point-of-care testing (POCT)—MIMIC-IV enables model training and validation in real-world clinical settings, thereby enhancing the generalizability and practical utility of AI-driven diagnostic tools for complex infectious diseases [
62,
63].
OpenFDA: In terms of publicly available databases, the U.S. Food and Drug Administration’s open data platform, OpenFDA (
https://open.fda.gov, accessed on 16 June 2025), offers valuable resources for infectious disease diagnostics. It provides harmonized access via application programming interfaces (APIs) to structured datasets including adverse event reports (FAERS), product labeling (SPL), recall information (RES), and medical device reports (MAUDE). For infectious disease applications, researchers can leverage OpenFDA to access post-market surveillance data related to antiviral drugs, vaccines, or diagnostic devices. AI models can utilize these data for pharmacovigilance tasks—such as mining adverse event signals—assessing diagnostic device performance, or analyzing usage trends during outbreaks. These real-world data enhance the external validation of AI tools and support integrated surveillance of diagnostics and therapeutics in infectious disease management [
64].
5.4. Multi-Omics and Integrated Databases
In the context of multi-omics data integration for infectious diseases, the Virus Pathogen Database and Analysis Resource serves as a prominent example of a comprehensive platform. ViPR aggregates multi-dimensional viral data—including genomic, proteomic, and immune epitope information—and offers a suite of integrated analysis tools. It particularly supports AI-driven investigations into viral host adaptation, antibody escape prediction, and vaccine design, providing a well-structured and annotated data foundation for in-depth analysis of high-throughput sequencing data. By facilitating combined multi-omics analyses of viral pathogens, ViPR stands as a key public resource that enhances AI-assisted diagnostics and research into pathogen evolution [
65,
66].
The European Nucleotide Archive (ENA): The European Nucleotide Archive (ENA), a key member of the International Nucleotide Sequence Database Collaboration (INSDC), serves as a vital repository providing high-quality public data for infectious disease diagnostics. ENA integrates large-scale sequencing data from clinical and environmental samples, encompassing raw reads, assembled genomes, and rich spatiotemporal metadata. In AI-driven diagnostic research, ENA supplies essential datasets for pathogen detection, variant tracking, and outbreak investigation. During the COVID-19 pandemic, for instance, ENA’s efficient submission and distribution APIs enabled real-time global sharing and analysis of SARS-CoV-2 genomic data, crucially supporting AI model training and outbreak response. Recent enhancements such as the Pathogens Portal and a tagging system further improve data discoverability and facilitate AI-based identification of pathogenic organisms within complex microbial communities [
67,
68,
69].
These databases represent valuable resources for training, validating, and benchmarking AI models in infectious disease diagnostics. However, challenges remain regarding data standardization, interoperability, and ethical sharing practices. Future efforts should focus on curating multi-institutional, multimodal datasets that are representative of global populations to enhance the generalizability and equity of AI diagnostic tools [
70,
71,
72,
73,
74,
75].
6. Challenges and Limitations in AI Diagnostic Technologies
Despite demonstrating substantial potential and achieving remarkable successes in the field of diagnostic assistance, AI continues to face formidable challenges on its path toward large-scale clinical implementation and routine application. Addressing these limitations is critical for realizing the full potential of AI in clinical practice.
6.1. Economic and Administrative Burden
First, the adoption of AI in medical fields imposes significant economic and administrative burdens on healthcare institutions. To ensure patient safety, medical AI systems must undergo rigorous regulatory approval processes demonstrating safety, efficacy, and transparency before clinical deployment. This process is often protracted and cost-prohibitive, involving large-scale clinical trials and complex regulatory documentation, thereby imposing substantial financial strain on both development institutions and healthcare facilities [
76]. Beyond initial deployment, AI systems require continuous investment, including iterative data integration, software updates, and hardware maintenance. These systems demand specialized personnel in medical informatics, data science, and IT support for ongoing system maintenance, updates, and user training, constituting persistent economic and administrative burdens for healthcare institutions [
77,
78].
6.2. Regulatory and Clinical Integration Barriers
Moreover, the regulatory landscape for medical AI is rapidly evolving worldwide, and stringent regulations impede the progression from development and validation to full integration into healthcare systems; this poses significant challenges for medical institutions [
79,
80,
81,
82]. Furthermore, AI tools face linguistic and cultural application constraints. When deployed in diverse linguistic environments or among populations with unique cultural health beliefs, their performance may significantly decline, necessitating localization adjustments that require additional resources and time investment [
83,
84]. Additionally, a critical challenge lies in effectively communicating the complex numerical outputs of AI models to physicians in an accurate, intuitive, and clinically meaningful manner, while seamlessly integrating them into clinical decision-making processes. Establishing efficient human–AI interaction interfaces is imperative to ensure that AI-generated recommendations are correctly interpreted and appropriately adopted in practice [
85].
6.3. Data Bottlenecks and Model Limitations
In complex scenarios such as medical imaging, AI must rely on vast amounts of data for deep training and iterative algorithm optimization to achieve accurate and reliable analysis [
86]. However, large-scale, well-annotated datasets remain scarce and costly to generate. Discrepancies in diagnostic criteria across regions and stringent patient privacy protection requirements exacerbate difficulties in data integration and sharing, ultimately severely impeding the deep application and widespread adoption of AI in medical imaging [
87].
Annotation scarcity and the difficulty of generating large-scale, high-quality labeled datasets remain among the most critical barriers to clinical AI implementation. To address this, the research community is actively developing innovative strategies. Self-supervised learning (SSL) offers a promising avenue by pre-training models on vast amounts of unlabeled data (e.g., raw medical images or genomic sequences) to learn meaningful representations before fine-tuning on limited labeled datasets. For instance, models pre-trained with SSL on large chest CT archives have shown improved performance in downstream tasks like COVID-19 classification with minimal labeled examples [
88,
89]. Semi-supervised learning and active learning frameworks can strategically select the most informative samples for expert annotation, maximizing model performance gains while minimizing labeling effort [
90,
91,
92]. Furthermore, synthetic data generation techniques, particularly generative adversarial networks (GANs), can create realistic, annotated medical data to augment small training sets, thereby improving model robustness and helping to address class imbalance [
93,
94]. However, ensuring the clinical fidelity and avoiding bias propagation in synthetic data remain challenges. Finally, domain adaptation and generalization methods are crucial for enhancing model robustness across different hospitals, scanner protocols, and patient populations. These techniques aim to learn invariant features that perform well on data distributions different from the training set, which is essential for real-world deployment [
95,
96].
While these approaches show considerable promise, their integration into standardized diagnostic workflows and regulatory acceptance are ongoing endeavors. Future work should focus on benchmarking these methods in large-scale, multi-center studies specific to infectious disease diagnostics to validate their practical utility.
7. Summary and Future Directions
7.1. Summary
This review systematically examines the rapid advancement of AI in infectious disease diagnostics and its multifaceted clinical utility. In the domain of high-throughput sequencing, AI has demonstrated transformative capabilities in pathogen identification, characterization, and real-time surveillance of outbreaks and transmission patterns. In the realm of image analysis, AI enables automated detection and quantification of pathogens in digital microscopy images while enhancing diagnostic accuracy for various infectious diseases through radiological imaging. For point-of-care testing, AI augments syndromic surveillance platforms and facilitates the development of intelligent, field-deployable diagnostic devices.
Despite these advancements, significant challenges persist in data availability, model generalizability, economic sustainability, and clinical integration. Addressing these limitations requires collaborative efforts among clinicians, microbiologists, computational scientists, and policymakers. Future directions should focus on developing more robust and adaptable AI systems, establishing standardized datasets and evaluation frameworks, and creating streamlined regulatory pathways for AI-based infectious disease diagnostics. Through this multidisciplinary approach, AI can realize its full potential to revolutionize infectious disease diagnosis and management, ultimately enhancing global health security.
7.2. Future Directions
Despite the promising advances outlined in this review, the full integration of AI into routine infectious disease diagnostics requires addressing several persistent challenges. Future research and development efforts should therefore prioritize a number of key directions to bridge the gap between technological potential and clinical utility.
First, overcoming data-related bottlenecks is paramount. Efforts should focus on the development and adoption of standardized, large-scale, and well-annotated multimodal datasets that are representative of diverse populations and settings. To mitigate the high cost of manual annotation, exploring advanced techniques such as self-supervised learning, federated learning for privacy-preserving collaborative model training, and synthetic data generation holds significant promise for enhancing model robustness and generalizability.
Second, the next generation of AI models must be designed with clinical translation in mind. This involves creating more interpretable and trustworthy systems through the integration of explainable AI (XAI) principles, enabling clinicians to understand the rationale behind AI-generated insights. Furthermore, developing lightweight and computationally efficient algorithms will be critical for deploying AI models on mobile or point-of-care devices in resource-limited environments.
Finally, successful clinical integration will depend on establishing streamlined regulatory pathways and fostering effective human-AI collaboration. Prospective, multi-center clinical validation studies are urgently needed to generate robust evidence of efficacy and cost-effectiveness. Simultaneously, research should focus on designing intuitive user interfaces and clinical workflows that seamlessly integrate AI outputs into decision-making processes, ultimately supporting rather than replacing clinical expertise.
By addressing these priorities through collaborative efforts among clinicians, researchers, regulatory bodies, and industry partners, AI can evolve from a powerful analytical tool into a dependable ally in the global fight against infectious diseases.
Author Contributions
C.D.: writing—original draft preparation, searching literature, formal analysis, writing—review and editing; Y.L.: writing—review and editing, graph drawing; J.N. and X.Z.: formal analysis and manuscript revision; F.Y. and Y.Z.: conceptualization, writing—review and editing. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
No new data were created or analyzed in this study. Data sharing is not applicable to this article.
Conflicts of Interest
Author Yongfei Zhou was employed by the company Beijing Hemu Biotechnology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
AI | Artificial Intelligence |
AINU | Artificial Intelligence of the Nucleus |
CBCT | Cone-Beam Computed Tomography |
ML | Machine Learning |
HTS | High-Throughput Sequencing |
POCT | Point-of-Care Testing |
SVM | Support Vector Machines |
HMM | Hidden Markov Models |
SNVs | Single-Nucleotide Variations |
mAP | Mean Average Precision |
ARDS | Acute Respiratory Distress Syndrome |
CXR | Chest X-ray |
XAI | Explainable Artificial Intelligence |
LFA | Lateral Flow immunoassays |
VFA | Vertical Flow immunoassays |
NAAT | Nucleic Acid Amplification Tests |
CNN | Convolutional Neural Network(s) |
BiLSTM | Bidirectional Long Short-Term Memory |
GANs | Generative Adversarial Networks |
SSL | Self-Supervised Learning |
RMMSP | Rapid Multiplex Molecular Syndromic Panel |
References
- Topol, E.J. High-performance medicine: The convergence of human and artificial intelligence. Nat. Med. 2019, 25, 44–56. [Google Scholar] [CrossRef]
- Khattak, M.; Kierkegaard, P.; McGregor, A.; Perry, D.C. Bridging innovation to implementation in artificial intelligence fracture detection: A commentary piece. Bone Jt. J. 2025, 107-B, 582–586. [Google Scholar] [CrossRef]
- He, J.; Baxter, S.L.; Xu, J.; Xu, J.; Zhou, X.; Zhang, K. The practical implementation of artificial intelligence technologies in medicine. Nat. Med. 2019, 25, 30–36. [Google Scholar] [CrossRef]
- Kraemer, M.U.G.; Tsui, J.L.; Chang, S.Y.; Lytras, S.; Khurana, M.P.; Vanderslott, S.; Bajaj, S.; Scheidwasser, N.; Curran-Sebastian, J.L.; Semenova, E.; et al. Artificial intelligence for modelling infectious disease epidemics. Nature 2025, 638, 623–635. [Google Scholar] [CrossRef]
- Chu, W.T.; Reza, S.M.S.; Anibal, J.T.; Landa, A.; Crozier, I.; Bağci, U.; Wood, B.J.; Solomon, J. Artificial Intelligence and Infectious Disease Imaging. J. Infect. Dis. 2023, 228, S322–S336. [Google Scholar] [CrossRef] [PubMed]
- Cesaro, A.; Hoffman, S.C.; Das, P.; de la Fuente-Nunez, C. Challenges and applications of artificial intelligence in infectious diseases and antimicrobial resistance. npj Antimicrob. Resist. 2025, 3, 2. [Google Scholar] [CrossRef] [PubMed]
- Srivastava, V.; Kumar, R.; Wani, M.Y.; Robinson, K.; Ahmad, A. Role of artificial intelligence in early diagnosis and treatment of infectious diseases. Infect. Dis. 2025, 57, 1–26. [Google Scholar] [CrossRef]
- Bonazzetti, C.; Rocchi, E.; Toschi, A.; Derus, N.R.; Sala, C.; Pascale, R.; Rinaldi, M.; Campoli, C.; Pasquini, Z.A.I.; Tazza, B.; et al. Artificial Intelligence model to predict resistances in Gram-negative bloodstream infections. npj Digit. Med. 2025, 8, 319. [Google Scholar] [CrossRef] [PubMed]
- Shafieioun, A.; Ghaffari, H.; Baradaran, M.; Rigi, A.; Shahir Eftekhar, M.; Shojaeshafiei, F.; Korani, M.A.; Hatami, B.; Shirdel, S.; Ghanbari, K.; et al. Predictive power of artificial intelligence for malignant cerebral edema in stroke patients: A CT-based systematic review and meta-analysis of prevalence and diagnostic performance. Neurosurg. Rev. 2025, 48, 318. [Google Scholar] [CrossRef]
- Garcia-Garcia, S.; Perez-Arguello, A.; Henares, D.; Timoneda, N.; Muñoz-Almagro, C. Rapid identification, capsular typing and molecular characterization of Streptococcus pneumoniae by using whole genome nanopore sequencing. BMC Microbiol. 2020, 20, 347. [Google Scholar] [CrossRef]
- Hosny, A.; Parmar, C.; Quackenbush, J.; Schwartz, L.H.; Aerts, H. Artificial intelligence in radiology. Nat. Rev. Cancer 2018, 18, 500–510. [Google Scholar] [CrossRef]
- Manickam, P.; Mariappan, S.A.; Murugesan, S.M.; Hansda, S.; Kaushik, A.; Shinde, R.; Thipperudraswamy, S.P. Artificial Intelligence (AI) and Internet of Medical Things (IoMT) Assisted Biomedical Systems for Intelligent Healthcare. Biosensors 2022, 12, 562. [Google Scholar] [CrossRef]
- Mobadersany, P.; Yousefi, S.; Amgad, M.; Gutman, D.A.; Barnholtz-Sloan, J.S.; Velázquez Vega, J.E.; Brat, D.J.; Cooper, L.A.D. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl. Acad. Sci. USA 2018, 115, E2970–E2979. [Google Scholar] [CrossRef]
- Cui, Z.X.; Liu, C.; Fan, X.; Cao, C.; Cheng, J.; Zhu, Q.; Liu, Y.; Jia, S.; Wang, H.; Zhu, Y.; et al. Physics-Informed DeepMRI: K-Space Interpolation Meets Heat Diffusion. IEEE Trans. Med. Imaging 2024, 43, 3503–3520. [Google Scholar] [CrossRef]
- Lalmuanawma, S.; Hussain, J.; Chhakchhuak, L. Applications of machine learning and artificial intelligence for COVID-19 (SARS-CoV-2) pandemic: A review. Chaos Solitons Fractals 2020, 139, 110059. [Google Scholar] [CrossRef]
- Gordon-Rodriguez, E.; Quinn, T.P.; Cunningham, J.P. Learning sparse log-ratios for high-throughput sequencing data. Bioinformatics 2021, 38, 157–163. [Google Scholar] [CrossRef]
- Ning, H.; Boyes, I.; Numanagić, I.; Rott, M.; Xing, L.; Zhang, X. Diagnostics of viral infections using high-throughput genome sequencing data. Brief. Bioinform. 2024, 25, bbae501. [Google Scholar] [CrossRef]
- Christians, F.C.; Akhund-Zade, J.; Jarman, K.; Venkatasubrahmanyam, S.; Noll, N.; Blauwkamp, T.A.; Bercovici, S.; Zielinska, A.; Carr, A.L.; Craney, A.; et al. Analytical and clinical validation of direct detection of antimicrobial resistance markers by plasma microbial cell-free DNA sequencing. J. Clin. Microbiol. 2024, 62, e0042524. [Google Scholar] [CrossRef] [PubMed]
- Lin, T.Y.; Ming-Fang Yen, A.; Li-Sheng Chen, S.; Hsu, C.Y.; Yeh, Y.P.; Hsiu-Hsi Chen, T. Artificial intelligence for precision viral surveillance of emerging infectious disease (EID): Data-driven digital twin metaverse-envisioned study. Comput. Biol. Med. 2025, 196, 110877. [Google Scholar] [CrossRef] [PubMed]
- Xu, C.; Zhao, L.Y.; Ye, C.S.; Xu, K.C.; Xu, K.Y. The application of machine learning in clinical microbiology and infectious diseases. Front. Cell. Infect. Microbiol. 2025, 15, 1545646. [Google Scholar] [CrossRef] [PubMed]
- Arango-Argoty, G.; Garner, E.; Pruden, A.; Heath, L.S.; Vikesland, P.; Zhang, L. DeepARG: A deep learning approach for predicting antibiotic resistance genes from metagenomic data. Microbiome 2018, 6, 23. [Google Scholar] [CrossRef]
- Tao, C.; Du, J.; Tang, Y.; Wang, J.; Dong, K.; Yang, M.; Hu, B.; Zhang, Z. A Deep-Learning Based System for Rapid Genus Identification of Pathogens under Hyperspectral Microscopic Images. Cells 2022, 11, 2237. [Google Scholar] [CrossRef]
- Grubaugh, N.D.; Gangavarapu, K.; Quick, J.; Matteson, N.L.; De Jesus, J.G.; Main, B.J.; Tan, A.L.; Paul, L.M.; Brackney, D.E.; Grewal, S.; et al. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol. 2019, 20, 8. [Google Scholar] [CrossRef]
- Cheah, B.C.J.; Vicente, C.R.; Chan, K.R. Machine Learning and Artificial Intelligence for Infectious Disease Surveillance, Diagnosis, and Prognosis. Viruses 2025, 17, 882. [Google Scholar] [CrossRef]
- Chen, J.; Yuan, D.; Dong, R.; Cai, J.; Ai, Z.; Zhou, S. Artificial intelligence significantly facilitates development in the mental health of college students: A bibliometric analysis. Front. Psychol. 2024, 15, 1375294. [Google Scholar] [CrossRef] [PubMed]
- Quick, J.; Loman, N.J.; Duraffour, S.; Simpson, J.T.; Severi, E.; Cowley, L.; Bore, J.A.; Koundouno, R.; Dudas, G.; Mikhail, A.; et al. Real-time, portable genome sequencing for Ebola surveillance. Nature 2016, 530, 228–232. [Google Scholar] [CrossRef] [PubMed]
- Whata, A.; Chimedza, C. Deep Learning for SARS COV-2 Genome Sequences. IEEE Access 2021, 9, 59597–59611. [Google Scholar] [CrossRef] [PubMed]
- The COVID-19 Genomics UK (COG-UK) Consortium. An integrated national scale SARS-CoV-2 genomic surveillance network. Lancet Microbe 2020, 1, e99–e100. [Google Scholar] [CrossRef] [PubMed]
- Xie, R.; Adam, D.C.; Hu, S.; Cowling, B.J.; Gascuel, O.; Zhukova, A.; Dhanasekaran, V. Integrating Contact Tracing Data to Enhance Outbreak Phylodynamic Inference: A Deep Learning Approach. Mol. Biol. Evol. 2024, 41, msae232. [Google Scholar] [CrossRef]
- Telenti, A.; Hodcroft, E.B.; Robertson, D.L. The Evolution and Biology of SARS-CoV-2 Variants. Cold Spring Harb. Perspect. Med. 2022, 12, a041390. [Google Scholar] [CrossRef]
- Maturana, C.R.; de Oliveira, A.D.; Nadal, S.; Bilalli, B.; Serrat, F.Z.; Soley, M.E.; Igual, E.S.; Bosch, M.; Lluch, A.V.; Abelló, A.; et al. Advances and challenges in automated malaria diagnosis using digital microscopy imaging with artificial intelligence tools: A review. Front. Microbiol. 2022, 13, 1006659. [Google Scholar] [CrossRef]
- Sukumarran, D.; Hasikin, K.; Mohd Khairuddin, A.S.; Ngui, R.; Wan Sulaiman, W.Y.; Vythilingam, I.; Divis, P.C.S. An automated malaria cells detection from thin blood smear images using deep learning. Trop. Biomed. 2023, 40, 208–219. [Google Scholar] [CrossRef]
- Al Kuwaiti, A.; Nazer, K.; Al-Reedy, A.; Al-Shehri, S.; Al-Muhanna, A.; Subbarayalu, A.V.; Al Muhanna, D.; Al-Muhanna, F.A. A Review of the Role of Artificial Intelligence in Healthcare. J. Pers. Med. 2023, 13, 951. [Google Scholar] [CrossRef] [PubMed]
- Alsharif, W.; Qurashi, A. Effectiveness of COVID-19 diagnosis and management tools: A review. Radiography 2021, 27, 682–687. [Google Scholar] [CrossRef] [PubMed]
- Vogele, D.; Otto, S.; Sollmann, N.; Haggenmüller, B.; Wolf, D.; Beer, M.; Schmidt, S.A. Sarcopenia—Definition, Radiological Diagnosis, Clinical Significance. RoFo Fortschritte Auf Dem Geb. Der Rontgenstrahlen Und Der Nukl. 2023, 195, 393–405. [Google Scholar] [CrossRef]
- Suri, J.S.; Agarwal, S.; Gupta, S.K.; Puvvula, A.; Biswas, M.; Saba, L.; Bit, A.; Tandel, G.S.; Agarwal, M.; Patrick, A.; et al. A narrative review on characterization of acute respiratory distress syndrome in COVID-19-infected lungs using artificial intelligence. Comput. Biol. Med. 2021, 130, 104210. [Google Scholar] [CrossRef] [PubMed]
- Shi, F.; Wang, J.; Shi, J.; Wu, Z.; Wang, Q.; Tang, Z.; He, K.; Shi, Y.; Shen, D. Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation, and Diagnosis for COVID-19. IEEE Rev. Biomed. Eng. 2021, 14, 4–15. [Google Scholar] [CrossRef]
- Bouchareb, Y.; Moradi Khaniabadi, P.; Al Kindi, F.; Al Dhuhli, H.; Shiri, I.; Zaidi, H.; Rahmim, A. Artificial intelligence-driven assessment of radiological images for COVID-19. Comput. Biol. Med. 2021, 136, 104665. [Google Scholar] [CrossRef]
- Song, Y.; Zheng, S.; Li, L.; Zhang, X.; Zhang, X.; Huang, Z.; Chen, J.; Wang, R.; Zhao, H.; Chong, Y.; et al. Deep Learning Enables Accurate Diagnosis of Novel Coronavirus (COVID-19) with CT Images. IEEE/ACM Trans. Comput. Biol. Bioinform. 2021, 18, 2775–2780. [Google Scholar] [CrossRef]
- Zhang, K.; Liu, X.; Shen, J.; Li, Z.; Sang, Y.; Wu, X.; Zha, Y.; Liang, W.; Wang, C.; Wang, K.; et al. Clinically Applicable AI System for Accurate Diagnosis, Quantitative Measurements, and Prognosis of COVID-19 Pneumonia Using Computed Tomography. Cell 2020, 181, 1423–1433.e1411. [Google Scholar] [CrossRef]
- Kuo, M.D.; Chiu, K.W.H.; Wang, D.S.; Larici, A.R.; Poplavskiy, D.; Valentini, A.; Napoli, A.; Borghesi, A.; Ligabue, G.; Fang, X.H.B.; et al. Multi-center validation of an artificial intelligence system for detection of COVID-19 on chest radiographs in symptomatic patients. Eur. Radiol. 2023, 33, 23–33. [Google Scholar] [CrossRef]
- Carnevali, D.; Zhong, L.; González-Almela, E.; Viana, C.; Rotkevich, M.; Wang, A.; Franco-Barranco, D.; Gonzalez-Marfil, A.; Neguembor, M.V.; Castells-Garcia, A.; et al. A deep learning method that identifies cellular heterogeneity using nanoscale nuclear features. Nat. Mach. Intell. 2024, 6, 1021–1033. [Google Scholar] [CrossRef]
- Baxi, V.; Edwards, R.; Montalto, M.; Saha, S. Digital pathology and artificial intelligence in translational medicine and clinical practice. Mod. Pathol. 2022, 35, 23–32. [Google Scholar] [CrossRef] [PubMed]
- Han, G.R.; Goncharov, A.; Eryilmaz, M.; Ye, S.; Palanisamy, B.; Ghosh, R.; Lisi, F.; Rogers, E.; Guzman, D.; Yigci, D.; et al. Machine learning in point-of-care testing: Innovations, challenges, and opportunities. Nat. Commun. 2025, 16, 3165. [Google Scholar] [CrossRef]
- Lee, S.; Park, J.S.; Woo, H.; Yoo, Y.K.; Lee, D.; Chung, S.; Yoon, D.S.; Lee, K.B.; Lee, J.H. Rapid deep learning-assisted predictive diagnostics for point-of-care testing. Nat. Commun. 2024, 15, 1695. [Google Scholar] [CrossRef]
- Candel, F.J.; Salavert, M.; Cantón, R.; Del Pozo, J.L.; Galán-Sánchez, F.; Navarro, D.; Rodríguez, A.; Rodríguez, J.C.; Rodríguez-Aguirregabiria, M.; Suberviola, B.; et al. The role of rapid multiplex molecular syndromic panels in the clinical management of infections in critically ill patients: An experts-opinion document. Crit. Care 2024, 28, 440. [Google Scholar] [CrossRef]
- Khan, A.I.; Khan, M.; Khan, R. Artificial Intelligence in Point-of-Care Testing. Ann. Lab. Med. 2023, 43, 401–407. [Google Scholar] [CrossRef] [PubMed]
- Yammouri, G.; Ait Lahcen, A. AI-Reinforced Wearable Sensors and Intelligent Point-of-Care Tests. J. Pers. Med. 2024, 14, 1088. [Google Scholar] [CrossRef]
- Shu, Y.; McCauley, J. GISAID: Global initiative on sharing all influenza data—From vision to reality. Eurosurveillance 2017, 22, 30494. [Google Scholar] [CrossRef] [PubMed]
- Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef]
- Rambaut, A.; Holmes, E.C.; O’Toole, Á.; Hill, V.; McCrone, J.T.; Ruis, C.; du Plessis, L.; Pybus, O.G. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 2020, 5, 1403–1407. [Google Scholar] [CrossRef]
- Viana, R.; Moyo, S.; Amoako, D.G.; Tegally, H.; Scheepers, C.; Althaus, C.L.; Anyaneji, U.J.; Bester, P.A.; Boni, M.F.; Chand, M.; et al. Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature 2022, 603, 679–686. [Google Scholar] [CrossRef]
- Wood, D.E.; Salzberg, S.L. Kraken: Ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014, 15, R46. [Google Scholar] [CrossRef]
- Leinonen, R.; Sugawara, H.; Shumway, M. The sequence read archive. Nucleic Acids Res. 2011, 39, D19–D21. [Google Scholar] [CrossRef]
- Alcock, B.P.; Raphenya, A.R.; Lau, T.T.Y.; Tsang, K.K.; Bouchard, M.; Edalatmand, A.; Huynh, W.; Nguyen, A.V.; Cheng, A.A.; Liu, S.; et al. CARD 2020: Antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2020, 48, D517–D525. [Google Scholar] [CrossRef]
- Wattam, A.R.; Davis, J.J.; Assaf, R.; Boisvert, S.; Brettin, T.; Bun, C.; Conrad, N.; Dietrich, E.M.; Disz, T.; Gabbard, J.L.; et al. Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center. Nucleic Acids Res. 2017, 45, D535–D542. [Google Scholar] [CrossRef]
- Davis, J.J.; Boisvert, S.; Brettin, T.; Kenyon, R.W.; Mao, C.; Olson, R.; Overbeek, R.; Santerre, J.; Shukla, M.; Wattam, A.R.; et al. Antimicrobial Resistance Prediction in PATRIC and RAST. Sci. Rep. 2016, 6, 27930. [Google Scholar] [CrossRef] [PubMed]
- Wynants, L.; Van Calster, B.; Collins, G.S.; Riley, R.D.; Heinze, G.; Schuit, E.; Bonten, M.M.J.; Dahly, D.L.; Damen, J.A.A.; Debray, T.P.A.; et al. Prediction models for diagnosis and prognosis of COVID-19: Systematic review and critical appraisal. BMJ 2020, 369, m1328. [Google Scholar] [CrossRef] [PubMed]
- Rajaraman, S.; Antani, S.K.; Poostchi, M.; Silamut, K.; Hossain, M.A.; Maude, R.J.; Jaeger, S.; Thoma, G.R. Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images. PeerJ 2018, 6, e4568. [Google Scholar] [CrossRef] [PubMed]
- Ufuktepe, D.K.; Yang, F.; Kassim, Y.M.; Yu, H.; Maude, R.J.; Palaniappan, K.; Jaeger, S. Deep Learning-Based Cell Detection and Extraction in Thin Blood Smears for Malaria Diagnosis. In Proceedings of the 2021 IEEE Applied Imagery Pattern Recognition Workshop, Washington, DC, USA, 12–14 October 2021; p. 9762109. [Google Scholar] [CrossRef]
- Rosenthal, A.; Gabrielian, A.; Engle, E.; Hurt, D.E.; Alexandru, S.; Crudu, V.; Sergueev, E.; Kirichenko, V.; Lapitskii, V.; Snezhko, E.; et al. The TB Portals: An Open-Access, Web-Based Platform for Global Drug-Resistant-Tuberculosis Data Sharing and Analysis. J. Clin. Microbiol. 2017, 55, 3267–3282. [Google Scholar] [CrossRef]
- Johnson, A.E.W.; Bulgarelli, L.; Shen, L.; Gayles, A.; Shammout, A.; Horng, S.; Pollard, T.J.; Hao, S.; Moody, B.; Gow, B.; et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci. Data 2023, 10, 1. [Google Scholar] [CrossRef]
- Reyna, M.A.; Josef, C.S.; Jeter, R.; Shashikumar, S.P.; Westover, M.B.; Nemati, S.; Clifford, G.D.; Sharma, A. Early Prediction of Sepsis From Clinical Data: The PhysioNet/Computing in Cardiology Challenge 2019. Crit. Care Med. 2020, 48, 210–217. [Google Scholar] [CrossRef]
- Kass-Hout, T.A.; Xu, Z.; Mohebbi, M.; Nelsen, H.; Baker, A.; Levine, J.; Johanson, E.; Bright, R.A. OpenFDA: An innovative platform providing access to a wealth of FDA’s publicly available data. J. Am. Med. Inform. Assoc. JAMIA 2016, 23, 596–600. [Google Scholar] [CrossRef] [PubMed]
- Pickett, B.E.; Sadat, E.L.; Zhang, Y.; Noronha, J.M.; Squires, R.B.; Hunt, V.; Liu, M.; Kumar, S.; Zaremba, S.; Gu, Z.; et al. ViPR: An open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012, 40, D593–D598. [Google Scholar] [CrossRef] [PubMed]
- Pickett, B.E.; Greer, D.S.; Zhang, Y.; Stewart, L.; Zhou, L.; Sun, G.; Gu, Z.; Kumar, S.; Zaremba, S.; Larsen, C.N.; et al. Virus pathogen database and analysis resource (ViPR): A comprehensive bioinformatics database and analysis resource for the coronavirus research community. Viruses 2012, 4, 3209–3226. [Google Scholar] [CrossRef]
- Leinonen, R.; Akhtar, R.; Birney, E.; Bower, L.; Cerdeno-Tárraga, A.; Cheng, Y.; Cleland, I.; Faruque, N.; Goodgame, N.; Gibson, R.; et al. The European Nucleotide Archive. Nucleic Acids Res. 2011, 39, D28–D31. [Google Scholar] [CrossRef]
- Yuan, D.; Ahamed, A.; Burgin, J.; Cummins, C.; Devraj, R.; Gueye, K.; Gupta, D.; Gupta, V.; Haseeb, M.; Ihsan, M.; et al. The European Nucleotide Archive in 2023. Nucleic Acids Res. 2024, 52, D92–D97. [Google Scholar] [CrossRef]
- Meyer, F.; Fritz, A.; Deng, Z.L.; Koslicki, D.; Lesker, T.R.; Gurevich, A.; Robertson, G.; Alser, M.; Antipov, D.; Beghini, F.; et al. Critical Assessment of Metagenome Interpretation: The second round of challenges. Nat. Methods 2022, 19, 429–440. [Google Scholar] [CrossRef]
- DeGrave, A.J.; Janizek, J.D.; Lee, S.I. AI for radiographic COVID-19 detection selects shortcuts over signal. medRxiv Prepr. Serv. Health Sci. 2020. [Google Scholar] [CrossRef]
- Seyyed-Kalantari, L.; Zhang, H.; McDermott, M.B.A.; Chen, I.Y.; Ghassemi, M. Underdiagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations. Nat. Med. 2021, 27, 2176–2182. [Google Scholar] [CrossRef]
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
- Price, W.N., II; Cohen, I.G. Privacy in the age of medical big data. Nat. Med. 2019, 25, 37–43. [Google Scholar] [CrossRef]
- Kickbusch, I.; Piselli, D.; Agrawal, A.; Balicer, R.; Banner, O.; Adelhardt, M.; Capobianco, E.; Fabian, C.; Singh Gill, A.; Lupton, D.; et al. The Lancet and Financial Times Commission on governing health futures 2030: Growing up in a digital world. Lancet 2021, 398, 1727–1776. [Google Scholar] [CrossRef] [PubMed]
- Obermeyer, Z.; Powers, B.; Vogeli, C.; Mullainathan, S. Dissecting racial bias in an algorithm used to manage the health of populations. Science 2019, 366, 447–453. [Google Scholar] [CrossRef] [PubMed]
- Yu, K.H.; Beam, A.L.; Kohane, I.S. Artificial intelligence in healthcare. Nat. Biomed. Eng. 2018, 2, 719–731. [Google Scholar] [CrossRef]
- Meijerink, L.M.; Dunias, Z.S.; Leeuwenberg, A.M.; de Hond, A.A.H.; Jenkins, D.A.; Martin, G.P.; Sperrin, M.; Peek, N.; Spijker, R.; Hooft, L.; et al. Updating methods for artificial intelligence-based clinical prediction models: A scoping review. J. Clin. Epidemiol. 2025, 178, 111636. [Google Scholar] [CrossRef]
- Kitamura, G.; Deible, C. Retraining an open-source pneumothorax detecting machine learning algorithm for improved performance to medical images. Clin. Imaging 2020, 61, 15–19. [Google Scholar] [CrossRef]
- Petrick, N.; Chen, W.; Delfino, J.G.; Gallas, B.D.; Kang, Y.; Krainak, D.; Sahiner, B.; Samala, R.K. Regulatory considerations for medical imaging AI/ML devices in the United States: Concepts and challenges. J. Med. Imaging 2023, 10, 051804. [Google Scholar] [CrossRef] [PubMed]
- Clark, P.; Kim, J.; Aphinyanaphongs, Y. Marketing and US Food and Drug Administration Clearance of Artificial Intelligence and Machine Learning Enabled Software in and as Medical Devices: A Systematic Review. JAMA Netw. Open 2023, 6, e2321792. [Google Scholar] [CrossRef]
- Muehlematter, U.J.; Bluethgen, C.; Vokinger, K.N. FDA-cleared artificial intelligence and machine learning-based medical devices and their 510(k) predicate networks. Lancet. Digit. Health 2023, 5, e618–e626. [Google Scholar] [CrossRef]
- Schulz, W.L.; Durant, T.J.S.; Krumholz, H.M. Validation and Regulation of Clinical Artificial Intelligence. Clin. Chem. 2019, 65, 1336–1337. [Google Scholar] [CrossRef]
- Zeng-Treitler, Q.; Kim, H.; Rosemblat, G.; Keselman, A. Can multilingual machine translation help make medical record content more comprehensible to patients? Stud. Health Technol. Inform. 2010, 160, 73–77. [Google Scholar]
- Starlander, M.; Bouillon, P.; Rayner, M.; Chatzichrisafis, N.; Hockey, B.A.; Isahara, H.; Kanzaki, K.; Nakao, Y.; Santaholma, M. Breaking the language barrier: Machine assisted diagnosis using the medical speech translator. Stud. Health Technol. Inform. 2005, 116, 811–816. [Google Scholar]
- Kapuria, S.; Minot, P.; Kapusta, A.; Ikoma, N.; Alambeigi, F. A Novel Dual Layer Cascade Reliability Framework for an Informed and Intuitive Clinician-AI Interaction In Diagnosis of Colorectal Cancer Polyps. IEEE J. Biomed. Health Inform. 2024, 28, 2326–2337. [Google Scholar] [CrossRef] [PubMed]
- Seetharam, K.; Min, J.K. Artificial Intelligence and Machine Learning in Cardiovascular Imaging. Methodist DeBakey Cardiovasc. J. 2020, 16, 263–271. [Google Scholar] [CrossRef]
- Liao, J.; Huang, L.; Qu, M.; Chen, B.; Wang, G. Artificial Intelligence in Coronary CT Angiography: Current Status and Future Prospects. Front. Cardiovasc. Med. 2022, 9, 896366. [Google Scholar] [CrossRef]
- Ter-Sarkisov, A. COVID-CT-Mask-Net: Prediction of COVID-19 from CT scans using regional features. Appl. Intell. 2022, 52, 9664–9675. [Google Scholar] [CrossRef] [PubMed]
- Zhou, Z.; Sodha, V.; Siddiquee, M.M.R.; Feng, R.; Tajbakhsh, N.; Gotway, M.B.; Liang, J. Models Genesis: Generic Autodidactic Models for 3D Medical Image Analysis. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention: MICCAI, Shenzhen, China, 13–17 October 2019; Volume 11767, pp. 384–393. [Google Scholar] [CrossRef]
- Hussain, M.A.; Mirikharaji, Z.; Momeny, M.; Marhamati, M.; Neshat, A.A.; Garbi, R.; Hamarneh, G. Active deep learning from a noisy teacher for semi-supervised 3D image segmentation: Application to COVID-19 pneumonia infection in CT. Comput. Med. Imaging Graph 2022, 102, 102127. [Google Scholar] [CrossRef]
- Wu, Y.; Ge, Z.; Zhang, D.; Xu, M.; Zhang, L.; Xia, Y.; Cai, J. Mutual consistency learning for semi-supervised medical image segmentation. Med. Image Anal. 2022, 81, 102530. [Google Scholar] [CrossRef] [PubMed]
- Jiang, Y.; Sui, X.; Ding, Y.; Xiao, W.; Zheng, Y.; Zhang, Y. A semi-supervised learning approach with consistency regularization for tumor histopathological images analysis. Front. Oncol. 2022, 12, 1044026. [Google Scholar] [CrossRef]
- Ahn, G.; Choi, B.S.; Ko, S.; Jo, C.; Han, H.S.; Lee, M.C.; Ro, D.H. High-resolution knee plain radiography image synthesis using style generative adversarial network adaptive discriminator augmentation. J. Orthop. Res. 2023, 41, 84–93. [Google Scholar] [CrossRef] [PubMed]
- Chen, Y.; Yang, X.H.; Wei, Z.; Heidari, A.A.; Zheng, N.; Li, Z.; Chen, H.; Hu, H.; Zhou, Q.; Guan, Q. Generative Adversarial Networks in Medical Image augmentation: A review. Comput. Biol. Med. 2022, 144, 105382. [Google Scholar] [CrossRef] [PubMed]
- Skandarani, Y.; Jodoin, P.M.; Lalande, A. GANs for Medical Image Synthesis: An Empirical Study. J. Imaging 2023, 9, 69. [Google Scholar] [CrossRef] [PubMed]
- Guan, H.; Liu, M. Domain Adaptation for Medical Image Analysis: A Survey. IEEE Trans. Bio-Med. Eng. 2022, 69, 1173–1185. [Google Scholar] [CrossRef]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).