Next Article in Journal
Phage Encapsulation and Delivery Technology: A Strategy for Treating Drug-Resistant Pathogenic Microorganisms
Previous Article in Journal
Cuproptosis Contributes to Cisplatin-Induced Nephrotoxicity: Insights into Thymol’s Potential Inhibitory and Protective Effects
Previous Article in Special Issue
Pharmacogenomics and Pharmacometabolomics in Precision Tramadol Prescribing for Enhanced Pain Management: Evidence from QBB and EMR Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Applications of Genome Sequencing in Infectious Diseases: From Pathogen Identification to Precision Medicine

by
Gulam Mustafa Hasan
1,
Taj Mohammad
2,
Anas Shamsi
3,*,
Sukhwinder Singh Sohal
4 and
Md. Imtaiyaz Hassan
2,*
1
Department of Basic Medical Science, College of Medicine, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia
2
Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi 110025, India
3
Centre of Medical and Bio-Allied Health Sciences Research, Ajman University, Ajman P.O. Box 346, United Arab Emirates
4
Respiratory Translational Research Group, Department of Laboratory Medicine, School of Health Sciences, College of Health and Medicine, University of Tasmania, The Shed Building, 80 Cimitiere Street, Launceston, TAS 7250, Australia
*
Authors to whom correspondence should be addressed.
Pharmaceuticals 2025, 18(11), 1687; https://doi.org/10.3390/ph18111687
Submission received: 22 September 2025 / Revised: 29 October 2025 / Accepted: 6 November 2025 / Published: 7 November 2025
(This article belongs to the Special Issue Pharmacogenomics for Precision Medicine)

Abstract

Background: Genome sequencing is transforming infectious-disease diagnostics, surveillance, and precision therapy by enabling rapid, high-resolution pathogen identification, transmission tracking, and genomic-informed antimicrobial stewardship. Methods: We review contemporary sequencing platforms (short- and long-read), targeted and metagenomic approaches, and operational workflows that connect laboratory outputs to clinical and public health decision-making. We highlight strengths and limitations of genomic AMR prediction, the role of plasmids and mobile elements in resistance and virulence, and practical steps for clinical translation, including validation, reporting standards, and integration with electronic health records. Results: Comparative and population genomics reveal virulence determinants and host–pathogen interactions that correlate with clinical outcomes, improving risk stratification for high-risk infections. Integrating sequencing with epidemiological and clinical metadata enhances surveillance, uncovers cryptic transmission pathways, and supports infection control policies. Despite these advances, clinical implementation faces technical and interpretative barriers, as well as challenges related to turnaround time, data quality, bioinformatic complexity, cost, and ethical considerations. These issues must be addressed to realize routine, point-of-care sequencing. Conclusions: Emerging solutions, including portable sequencing devices, standardized pipelines, and machine-learning models, promise faster, more actionable results and tighter integration with electronic health records. The widespread adoption of sequencing in clinical workflows has the potential to shift infectious disease management toward precision medicine, thereby improving diagnostics, treatment selection, and public health responses.

Graphical Abstract

1. Introduction

The rapid and precise identification of infectious agents has always been at the forefront of clinical care, infection prevention, and public health response [1]. Common diagnostic methods, such as culture, microscopy, and targeted molecular assays, remain essential; however, each has its own inherent limitations [2,3]. Traditional methods, such as culture, microscopy, and targeted molecular assays, have proven indispensable but are limited by turnaround time, sensitivity, and the requirement for prior pathogen knowledge [4,5]. Technical advances in short- and long-read platforms, together with culture-independent metagenomic sequencing, have revolutionized pathogen identification and resistance tracking across clinical and public health settings [1,6,7]. The clinical and public health impact of sequencing is already significant and continues to increase.
Whole-genome sequencing (WGS) provides unambiguous strain typing and high-resolution phylogenetic data for use in outbreak detection, source attribution, and transmission mapping, both within hospitals and communities, as well as across national borders [8]. The direct identification of resistance genes and predictive markers at the genomic level holds the potential for rapid, evidence-based antimicrobial decision-making and stewardship, leading to improved patient outcomes and reduced pressure for the selection of AMR [9]. By using comparative genomics to define virulence determinants and mechanisms of immune evasion, as well as genetic correlates of clinical severity, vaccine design and therapeutic development can be informed and accelerated. Pathogen sequence data paired with epidemiological metadata and electronic health record data facilitate risk stratification. They can help guide precision approaches to infectious disease management, such as targeted prophylaxis and personalized therapy options [10,11].
However, translating sequencing into routine clinical use faces a range of technical, operational, and ethical challenges. Developing metagenomic technologies that can shorten assay time to clinically relevant decision windows remains a priority. Equally important is the ability to distinguish infection from colonization or background contamination, and to interpret the clinical significance of genomic variants. Detection of novel or poorly characterized determinants requires curated databases and standardized interpretation frameworks. Institutional bioinformatics expertise and computational infrastructure vary, and discrepancies in laboratory accreditation, regulatory oversight, and reimbursement policies limit widespread implementation [12,13]. This review summarizes sequencing technologies, clinical and public health applications, and implementation challenges, with a focus on translational links between laboratory genomics, clinical practice, and public health.

2. Sequencing Technologies and Methodological Advances

Sequencing have shifted over the past decade from an esoteric research tool to a set of pragmatic tools applicable to clinical and public health questions. Current sequencing methods for infectious diseases can be broadly organized into platform technologies, targeted versus untargeted assays, and downstream analytical workflows [14].

2.1. Sequencing Platforms and Assay Selection

Short-read platforms (Illumina) offer high per-base accuracy and cost-efficiency for routine bacterial WGS and viral surveillance. In contrast, long-read platforms (PacBio HiFi, ONT) resolve plasmids, structural variation, and full operons, crucial when genomic context determines phenotype [1,15]. Targeted assays, such as amplicon panels or hybrid capture, increase sensitivity and speed for known targets, while untargeted metagenomics (mNGS) supports hypothesis-free detection in culture-negative or atypical infections (Table 1). Hybrid short/long read strategies often provide the best balance of accuracy and structural resolution when plasmid-resolved inference or mobile-element mapping is required [16,17]. The sequential steps from clinical specimen to actionable sequencing report are illustrated in Figure 1.

2.2. Enrichment, Library Preparation, and Informatics

Optimized enrichment strategies and standardized library preparation are essential to enhance pathogen detection in low-biomass clinical samples. These enrichment strategies, ranging from selective lysis and rRNA depletion to probe-based or on-device enrichment [1,15]. Common enrichment strategies, such as selective lysis, rRNA depletion, probe-based capture, or adaptive sampling, enhance pathogen signal and improve detection sensitivity. Bioinformatic pipelines encompass preprocessing, classification or assembly, resistance-gene detection, and variant annotation, culminating in clinically interpretable reports. Version-controlled databases (e.g., CARD, AMRFinder, ResFinder) and transparent documentation of thresholds ensure reproducibility and reliability. Machine-learning tools complement rule-based approaches but require rigorous validation against phenotypic data before clinical use [18]. Targeted short-read assays remain optimal for routine surveillance, while metagenomic sequencing and hybrid strategies are reserved for cases with unresolved diagnosis or structural complexity (Figure 2).

3. Pathogen Identification and Surveillance

Genome sequencing can provide the most immediate clinical and public health benefits, including accurate pathogen detection and the contextualization of isolates within an epidemiological setting. Here, we discuss clinical diagnostics, including both targeted and metagenomic approaches; genomic epidemiology for outbreak detection and source attribution; population-level surveillance, such as wastewater genomics; and practical aspects, with a particular emphasis on MDR Klebsiella. In each subsection, we discuss the advantages of sequencing compared to the standard of care, the limitations of sequencing, and the conversion of genomic data into actionable elements to guide clinical decision-making [16].

3.1. Clinical Pathogen Identification

Targeted sequencing enables high-sensitivity detection of predefined pathogens or resistance loci, while metagenomic sequencing supports hypothesis-free identification across bacteria, viruses, fungi, and parasites [19]. In practice, most laboratories employ hybrid workflows that combine rapid targeted assays for common pathogens with metagenomic sequencing for unresolved or atypical cases, maximizing clinical yield while controlling cost and turnaround time [20]. Pragmatic challenges are poor sensitivity in the setting of abundant host nucleic acid, inability to distinguish colonization/commensal flora from invasive disease, and a higher burden of bioinformatic analysis and clinical correlation [21]. Most laboratories have developed hybrid workflows that incorporate rapid targeted assays for common, high-probability pathogens, reserving mNGS diagnostics for cases that remain unresolved or in which broad pathogen discovery is necessary, as the clinical yield increases [20,22,23] (Table 2).

3.2. Genomic Epidemiology

Whole-genome sequencing enables fine-scale resolution of transmission chains across healthcare and community settings, surpassing conventional typing methods [24]. Integration of genomic data with epidemiological metadata, such as timing, patient movement, and exposure, accelerates outbreak tracing and guides targeted infection control interventions. Standardized analysis frameworks are essential to avoid overinterpretation and ensure consistent resolution thresholds [25]. Additionally, researchers should consider pathogen-specific mutation rates, within-host diversity, and plasmid transmission, which can decouple resistance genotypes from chromosomal phylogeny.

3.3. Population-Scale Surveillance

In addition to individual and outbreak diagnosis, sequencing enables continuous surveillance of pathogen evolution, resistance emergence, and geographic spread. These genomic surveillance programs, both national and regional, for influenza, SARS-CoV-2, foodborne pathogens, and priority AMR organisms, provide an evidence base for public health policy, e.g., vaccine strain selection, infection control guidance, and stewardship priorities [26]. National and regional genomic surveillance programs for influenza, SARS-CoV-2, and priority AMR organisms inform infection-control policies and vaccine strain selection. Wastewater sequencing detected rising Omicron lineage prevalence earlier than clinical sampling in several regions, prompting targeted clinical testing and public health interventions [27]. Global coordination efforts such as the WHO GLASS Genomic Module, CDC Pathogen Genomics Program, and EMBL-EBI Pathogenwatch framework now standardize genomic data submission, ensuring comparability and rapid dissemination of AMR and outbreak information worldwide. For Klebsiella pneumoniae, genomic surveillance has revealed global spread of carbapenemase plasmids and high-risk lineages, informing infection-control strategies [28,29].

4. Antimicrobial-Resistance Prediction from Genomes

Genomic prediction of antimicrobial resistance from pathogen genomes is among the highest-value clinical applications of sequencing, as it can shorten the time to effective therapy, guide antimicrobial stewardship, and detect and track high-risk resistance determinants. However, the promise of this approach is limited by several factors [26]. The accuracy of predictions depends on the quality of the reference knowledge that links sequences to phenotypes. Genomic prediction of AMR is one of the most clinically impactful applications of sequencing, offering faster therapy guidance and resistance tracking. However, it is constrained by genotype–phenotype variability and the quality of reference databases.

4.1. Reference Resources and the Genotype-Phenotype Framework

Accurate AMR prediction depends on curated resistance databases linking specific genes, alleles, and mutations to phenotypes [30]. Several well-known, actively curated resources are widely used in clinical and research pipelines for antimicrobial resistance analysis. The Comprehensive Antibiotic Resistance Database (CARD) provides an ontology-based collection of resistance determinants and detection models [31]. The Center for Genomic Epidemiology’s ResFinder and PointFinder systems identify acquired genes and chromosomal mutations associated with resistance. NCBI’s AMRFinderPlus uses a curated reference gene catalog and Hidden Markov Models to detect resistance genes and relevant mutations [32,33]. These resources vary in scale, scope of curation, and update frequency, and many clinical pipelines cross-reference two or more databases for increased sensitivity and interpretability.

4.2. Algorithms and Analytic Approaches: Rule-Based, Comparative, and ML Methods

Rule-based inference remains standard for well-characterized mechanisms such as beta-lactamases and target-site mutations, offering transparency and ease of validation. It remains the backbone of routine surveillance tools. Machine-learning models, including DeepARG and related frameworks, augment rule-based pipelines by capturing non-obvious genomic patterns. However, their clinical utility depends on large annotated training datasets, external validation, and transparent performance metrics [34]. The workflow linking genomic inputs to resistance phenotype prediction is illustrated in Figure 3. Several bioinformatic resources have been developed to support resistance prediction from sequence data, as summarized in Table 3.

4.3. Clinical Implementation and Validation

Clinical implementation of genomic AMR prediction requires analytical and clinical validation against phenotypic gold standards, with stringent quality assurance, contamination control, and version tracking of databases and pipelines [35]. Effective reporting frameworks should clearly state the detected genes, predicted phenotypes, and confidence levels, and should be supported by provenance details, such as database versions and thresholds. Stepwise adoption, beginning with surveillance and infection control before clinical therapy guidance, is recommended.

4.4. Limitations, Common Pitfalls, and Opportunities for Improvement

Three limitations repeatedly restrict the widespread substitution of phenotypic AST with genomic prediction. First, although determining the presence or absence of a sequence is straightforward, several factors can cause discrepancies between the genotype and phenotype [36]. Second, plasmids and their capacity for horizontal transfer allow resistance genes to move independently of the chromosomal background, further complicating prediction [37]. The third source of ambiguity is metagenomic context: the absence or presence of a resistance gene in a sample may not always be informative about the gene-carrying organisms that cause disease, i.e., the resistance might be present in non-pathogenic or dead cells. Closing these gaps is an active area of research. Genotype–phenotype discordance arises from regulatory variation, gene copy number, plasmid mobility, and novel mechanisms that are not present in reference databases. Therefore, a hybrid model combining rapid genomic prediction for stewardship with phenotypic AST for confirmation offers a pragmatic path toward reliability [38]. Standardized ontologies such as the Antibiotic Resistance Ontology (ARO) and curated repositories like GAARD improve interoperability and facilitate global benchmarking [39].

5. Virulence Determinants and Pathogenesis

Characterizing the pathogenic potential associated with genomic variation is crucial for utilizing sequencing to inform clinically relevant risk stratification, targeted therapy, and prevention. Sequencing facilitates systematic discovery and surveillance of virulence factors and their genomic contexts, enabling correlations with disease severity and transmission [40,41].

5.1. Genetic Architecture of Virulence

Pathogen virulence is an emergent property of both conserved core functions and a highly variable accessory genome. The mosaic nature of its architecture implies that clinical behavior can differ dramatically between closely related strains, driven by the gain or loss of a small number of high-impact loci. Pathogen virulence reflects both conserved core functions and a variable accessory genome often carried on plasmids or genomic islands; pangenome analyses therefore help identify conserved targets for diagnostics and lineage-associated features that explain the emergence of severe disease. High-quality assemblies that capture plasmids and repeats are essential to map these loci accurately [42,43].

5.2. Mobile Elements, Plasmids, and the Coupling of Resistance with Virulence

Mobile genetic elements, including plasmids, facilitate the rapid spread of both resistance and virulence gene determinants. The co-localization of these features on the same mobile platform, particularly plasmids, carries significant clinical implications [44]. Mobile genetic elements, including conjugative plasmids, integrative elements, and phages, drive rapid co-transmission of virulence and resistance loci. The co-location of virulence and AMR genes on mobile elements accelerates the emergence of high-risk convergent strains; therefore, plasmid and mobile-element reconstruction (often via long-read or hybrid assemblies) is a priority for genomic surveillance [45,46].

5.3. From Genotype to Mechanism: Virulence Factors, Regulation, and Host Interactions

Classes of virulence determinants recur across bacterial pathogens, illustrating how genomic data informs mechanistic understanding. Genomic data identifies candidate virulence determinants, but mechanistic inference requires integration with functional data (transcriptomics, proteomics, and in vivo models) because expression and regulatory context determine clinical impact. Longitudinal and host-paired studies are particularly valuable for distinguishing colonization from invasive behavior and for validating genomic risk markers.

5.4. Clinical Correlates: Predicting Severity, Tropism, and Transmission Potential

Translating genomic virulence signatures into clinically useful predictions is an active area of translational research. Here, we review characteristics associated with specific clinical syndromes, such as an increased risk of hepatic abscess or metastatic spread, as well as traits influenced by genetic markers or combinations, capsule types, siderophore repertoires, and regulatory genes that enhance mucoviscosity [47]. The widespread utility of this approach may allow categorizing isolates as higher- or lower-risk for infection, while enhancing empirical decision-making, infection control prioritization, and public health responses through genomic risk scores that aggregate the presence/absence of multiple virulence loci. Some genomic signatures show associations with severity in retrospective cohorts, but predictive performance varies by species and clinical context. Prospective validation with host metadata is required before genomic virulence scores inform individual patient care.

5.5. Translational Potential and Current Limitations

The rational design of antivirulence therapeutics, of monoclonal antibodies, and vaccines is facilitated by the genomic mapping of conserved virulence determinants. Thus, conserved surface antigens and key virulence enzymes are rational vaccine targets; in contrast, anti-virulence drugs that interfere with siderophore systems, capsule assembly, or secretion systems might reduce pathogenicity while avoiding the selective pressure characteristic of bactericidal compounds [48]. More specifically, genomics enables reverse vaccinology by identifying and designing conserved, surface-exposed proteins shared across clinically relevant lineages. Several limitations temper these opportunities. High variability in the accessory genome limits vaccine coverage, and antigenic diversity or phase variation can lead to immune escape. If the targeted virulence factor is absent or easily lost in many clinical strains (i.e., it is accessory rather than essential), the effectiveness of antivirulence interventions will be reduced, and such strategies may have limited utility [47,49]. Thus, successful translational programs require that target prevalence be defined through genomic surveillance, mechanistic relevance be established using functional validation, and population-level impact be predicted with epidemiological modeling [50,51].

6. Precision Medicine and Clinical Integration

Genome sequencing not only enhances our understanding of pathogen biology but also has the potential to directly influence patient care. The coupling of sequencing with precision medicine refocuses the approach to infectious disease management from a broad population-based approach to an intervention strategy reserved for carefully defined strata, defined by the pathogen genotype, patient risk profile, and epidemiological context. This includes a discussion of the utility of sequencing data in personalized therapy, biomarker-driven management, and public health decision-making, as well as the hurdles that need to be overcome to implement them as part of standard clinical practice [52,53].

6.1. Individualized Therapy and Tailored Antimicrobial Selection

Guiding antimicrobial treatment is the most direct clinical application of sequencing-based precision medicine. WGS or mNGS can simultaneously identify species and detect resistance genes or mutations, thereby predicting antimicrobial susceptibility. When delivered within clinically actionable windows through validated pipelines, genomic information can guide timely de-escalation of empiric broad-spectrum therapy and the selection of narrow-spectrum agents. In aggregate, earlier genomic-informed adjustments may reduce inappropriate antimicrobial exposure and support stewardship; however, these benefits require local validation linked to clinical outcomes [54,55,56].

6.2. Biomarker-Guided Management and Public Health Decision

Predictive biomarkers, combined with sequences that are above single-pathogen calls, can also be generated during sequencing. Information on pathogen lineages may inform the likelihood of treatment failure, relapse risk, or transmission capacity. Integration of pathogen genomics with host-response biomarkers (transcriptomics/proteomics) can improve risk stratification and identify patients who may benefit from adjunctive therapies. Early clinical applications should prioritize robust, prospectively validated biomarkers with clear clinical actionability [57,58,59]. At the population level, precision medicine in infectious diseases intersects with public health. The ability of sequencing-informed risk stratification to inform isolation precautions, cohorting decisions, or prioritization of prophylaxis in exposed contacts can directly support infection-control and public health decision-making [60]. Genomic-informed risk stratification thereby supports targeted public health actions while minimizing unnecessary restrictions, but this requires integrated clinical–public health data flows and governance. Effective translation therefore depends on multidisciplinary networks that link genomic laboratories, informatics teams, clinicians, and public health authorities (Figure 4).

7. Challenges, Limitations, and Ethical Considerations

A combination of technical, operational, interpretive, regulatory, and ethical barriers limits the translation of genomic data from the research bench and sentinel surveillance into clinical standard of care. These limitations collectively determine whether sequencing yields actionable clinical insights or introduces uncertainty.

7.1. Technical and Interpretive Limitations

Low pathogen load and excess host nucleic acid in blood, CSF, and respiratory samples often reduce the number of informative reads, limiting reliable pathogen detection and genotyping. While library preparation and enrichment strategies address this issue, they only partially mitigate it; enrichment can introduce biases, and targeted assays may lead to false negatives as primer or probe sites evolve [61]. Nucleic acid detection alone cannot differentiate between live and dead organisms or determine whether a detected organism is the causative agent, a colonizer, or an environmental contaminant. Therefore, it requires careful integration with clinical, radiological, and laboratory data. Interpretive challenges compound technical limitations. Many organism-drug pairs lack genotype-to-phenotype relationships, and existing reference databases often lag emerging mechanisms and differ in terminology and scope. Insights into AMR mechanisms and epidemiology. Plasmid mobility and horizontal gene transfer decouple resistance determinants from chromosomal phylogeny, complicating inference about transmission and about which organism within a polymicrobial specimen carries a given resistance gene [26]. Within-host diversity and heteroresistance compound the confusion: minority resistant subpopulations relevant to the clinical phenotype of an infection may remain undetectable by standard sequencing techniques due to their low frequency.

7.2. Operational, Regulatory, and Economic Barriers

Sequencing in acute infections must deliver results within 24–48 h, but rapid turnaround requires dedicated laboratory capacity and automation. Although rapid workflows are possible, they often require dedicated laboratory capacity, automated library preparation, and on-site computational resources. Outsourcing to reference laboratories reduces the capital burden but usually introduces unacceptable delays for rapid clinical decision-making. Regulatory frameworks and laboratory accreditation represent additional hurdles. Performance attributes of sequencing tools, including validation metrics for sequencing assays, analytical sensitivity, specificity, reproducibility, and clinical concordance with phenotypic standards, must be established and consistently maintained under a quality management system. Sequencing is currently treated by many regulatory authorities as a diagnostic procedure, with the stringency of controls applied to other established clinical assays. However, guidance continues to change and varies from jurisdiction to jurisdiction. The policies on reimbursement are lagging behind the capability; without predictable funding models, institutions may be unable to internalize the costs of equipment, consumables, informatics infrastructure, and specialist personnel required to sustain service delivery [62,63]. Sustained adoption will depend on coordinated investment in regional sequencing hubs, clear regulatory guidance, and predictable reimbursement models.

7.3. Ethical, Legal, and Social Implications

Pathogen genomic data are often accompanied by sensitive metadata, including collection dates, locations, and clinical context, which can inadvertently lead to the re-identification of individuals or institutions if appropriate governance frameworks are not in place. To safeguard privacy while maintaining data utility, institutions should implement a layered governance structure that includes minimal metadata sharing for public repositories, controlled access to sensitive linkages, such as patient identifiers, through data access committees, and the adoption of federated or privacy-preserving analytical approaches that enable model training on local data without bulk data transfers. Consent models must be pragmatic and context-specific: while routine diagnostic sequencing should be governed institutionally to allow for essential public health surveillance, secondary research applications should follow opt-in or broad consent principles supported by clear benefit-sharing mechanisms [64,65]. Equitable implementation in low- and middle-income countries requires investment in local capacity, fair data-sharing agreements, and recognition of data ownership [66,67,68]. Practical ways to enable privacy-respecting collaborative surveillance include federated data models, harmonized consent models, and privacy-preserving analytics. Embedding sequencing within multidisciplinary teams linking clinicians, microbiologists, bioinformaticians, and ethicists ensures responsible interpretation and patient-centered use.

8. Future Perspectives

Genome sequencing of infectious diseases has reached a level of maturity in which many of the obstacles to its larger-scale adoption are less technical than they are organizational, interpretive, and social. Advances in portable platforms, automation, and AI-assisted analytics will enable faster, bedside sequencing with real-time interpretation [1,69]. Machine-learning tools will complement rule-based frameworks if training data and validation metrics are transparent. Interoperable standards and privacy-preserving data systems are essential for secure, cross-jurisdictional sharing [1,70]. Global equity requires affordable platforms, regional hubs, and public–private partnerships to sustain reagent supply and workforce training. Also, data systems architecture and governance will be critical. Responsible genomic practice demands external quality assurance, standardized reference materials, and transparent governance so that sequencing becomes a routine, safe, and equitable component of infectious-disease care.

9. Conclusions

Genome sequencing is poised to transition from a specialized research tool to a cornerstone of clinical and public health practice, but achieving this shift demands coordinated, multi-level action. Sequencing provides information that was inconceivable even ten years ago, from identifying previously unrecognized pathogens to defining transmission networks with unprecedented resolution. Together with recent advances in bioinformatics and systems biology, it provides a potent framework for a better understanding and management of AMR, virulence, and host–pathogen interactions. Standardization and validation of sequencing workflows are essential, requiring the adoption of accredited, version-controlled pipelines, rigorous external quality assessment, and transparent documentation of analytical provenance. A staged approach to implementation should be pursued, starting with applications in pathogen surveillance and infection control, and then extending to therapeutic decision support once local validation and phenotypic confirmation are established. Interoperability must also be prioritized through the development of standardized APIs and report formats that allow seamless integration of concise genomic insights into electronic health records and antimicrobial stewardship systems. Sustainable adoption will further depend on investment in workforce development, regional sequencing hubs, and reliable reagent supply chains to ensure equitable access, particularly across low- and middle-income regions. Collectively, transparent governance, validated machine-learning frameworks, and federated data-sharing models will ensure that genome sequencing becomes a safe, equitable, and clinically meaningful component of infectious-disease medicine.

Author Contributions

Conceptualization, G.M.H. and M.I.H.; methodology, G.M.H. and T.M.; software, T.M. and S.S.S.; validation, A.S., S.S.S. and M.I.H.; formal analysis, A.S. and S.S.S.; investigation, T.M.; resources, G.M.H. data curation, T.M.; writing—original draft preparation, G.M.H. and T.M.; writing—review and editing, S.S.S. and M.I.H.; visualization, S.S.S.; supervision, S.S.S., and M.I.H.; project administration, M.I.H.; funding acquisition, G.M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work is sponsored by Prince Sattam Bin Abdulaziz University (Grant No. PSAU/2025/RV/7).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

G.M.H. thanks the Prince Sattam Bin Abdulaziz University (Grant No. PSAU/2025/RV/7). A.S. acknowledges Ajman University for APC support.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

AI—Artificial Intelligence; AMR—Antimicrobial Resistance; ANI—Average Nucleotide Identity; CDC—Centers for Disease Control and Prevention; cgMLST—Core Genome Multi-Locus Sequence Typing; CNV—Copy Number Variation; CSF—Cerebrospinal Fluid; ddPCR—Droplet Digital PCR; DNA—Deoxyribonucleic Acid; DNA-seq—DNA Sequencing; EHR—Electronic Health Records; HGT—Horizontal Gene Transfer; HIFI—High-Fidelity (long reads); HIV—Human Immunodeficiency Virus; ICU—Intensive Care Unit; INDEL—Insertion/Deletion; LIMS—Laboratory Information Management System; LOD—Limit of Detection; MDR—Multidrug-Resistant; ML—Machine Learning; MLST—Multi-Locus Sequence Typing; mNGS—Metagenomic Next-Generation Sequencing; NC—Negative Control; NGS—Next-Generation Sequencing; ONT—Oxford Nanopore Technologies; ORF—Open Reading Frame; PacBio—Pacific Biosciences; PDR—Pan-Drug-Resistant; PCR—Polymerase Chain Reaction; qPCR—Quantitative Polymerase Chain Reaction; QA—Quality Assurance; QC—Quality Control; RNA—Ribonucleic Acid; RNA-seq—RNA Sequencing; rRNA—Ribosomal RNA; RT-PCR—Reverse Transcription Polymerase Chain Reaction; SARS-CoV-2—Severe Acute Respiratory Syndrome Coronavirus 2; SNP—Single-Nucleotide Polymorphism; SNV—Single-Nucleotide Variant; ST—Sequence Type; WGS—Whole-Genome Sequencing; WHO—World Health Organization; XDR—Extensively Drug-Resistant.

References

  1. Elbehiry, A.; Abalkhail, A. Metagenomic Next-Generation Sequencing in Infectious Diseases: Clinical Applications, Translational Challenges, and Future Directions. Diagnostics 2025, 15, 1991. [Google Scholar] [CrossRef]
  2. Liu, Q.; Jin, X.; Cheng, J.; Zhou, H.; Zhang, Y.; Dai, Y. Advances in the application of molecular diagnostic techniques for the detection of infectious disease pathogens. Mol. Med. Rep. 2023, 27, 104. [Google Scholar] [CrossRef]
  3. França, R.F.; da Silva, C.C.; De Paula, S.O. Recent advances in molecular medicine techniques for the diagnosis, prevention, and control of infectious diseases. Eur. J. Clin. Microbiol. Infect. Dis. 2013, 32, 723–728. [Google Scholar] [CrossRef]
  4. Oehler, J.B.; Burns, K.; Warner, J.; Schmitz, U. Long-Read Sequencing for the Rapid Response to Infectious Diseases Outbreaks. Curr. Clin. Microbiol. Rep. 2025, 12, 10. [Google Scholar] [CrossRef] [PubMed]
  5. Wu, H.-C.; Chiu, Y.-T.; Wu, I.C.; Liou, C.-H.; Cheng, H.-W.; Kuo, S.-C.; Lauderdale, T.-L.; Sytwu, H.-K.; Liao, Y.-C.; Chen, F.-J. Streamlining whole genome sequencing for clinical diagnostics with ONT technology. Sci. Rep. 2025, 15, 6270. [Google Scholar] [CrossRef] [PubMed]
  6. Yan, M.; Shang, L.; Wang, Y.; Wang, C.; Cao, B. Metagenomic next-generation sequencing on treatment strategies and prognosis of patients with lower respiratory tract infections: A systematic review and meta-analysis. Int. J. Antimicrob. Agents 2025, 65, 107440. [Google Scholar] [CrossRef] [PubMed]
  7. Long, J.; Zeng, B.; Li, J.; Zhang, J.; Deng, G. Global trends in the application of nanopore sequencing technology in the detection of infectious disease pathogens: A bibliometric analysis from 2014 to 2024. Front. Med. 2025, 12, 1610063. [Google Scholar] [CrossRef]
  8. Payne, M.; Hu, D.; Wang, Q.; Sullivan, G.; Graham, R.M.; Rathnayake, I.U.; Jennison, A.V.; Sintchenko, V.; Lan, R. DODGE: Automated point source bacterial outbreak detection using cumulative long term genomic surveillance. Bioinformatics 2024, 40, btae427. [Google Scholar] [CrossRef]
  9. PFMG2025 contributors. PFMG2025-integrating genomic medicine into the national healthcare system in France. Lancet Reg. Health Eur. 2025, 50, 101183. [Google Scholar] [CrossRef]
  10. Yadav, D.; Patil-Takbhate, B.; Khandagale, A.; Bhawalkar, J.; Tripathy, S.; Khopkar-Kale, P. Next-Generation sequencing transforming clinical practice and precision medicine. Clin. Chim. Acta 2023, 551, 117568. [Google Scholar] [CrossRef]
  11. Lam, W.K.J.; Lau, C.S.; Luk, H.M.; Au, L.W.C.; Chan, G.C.P.; Chan, W.Y.H.; Cheng, S.S.W.; Cheng, T.H.T.; Cheung, L.L.; Cheung, Y.F.; et al. The implementation of genome sequencing in rare genetic diseases diagnosis: A pilot study from the Hong Kong genome project. Lancet Reg. Health–West. Pac. 2025, 55, 101473. [Google Scholar] [CrossRef]
  12. Ricaldi, J.N.; Parker, J.T.; Barnes, N.; Turner, H.; Santibañez, S. An Advanced Molecular Detection Roadmap for Nonlaboratorians. Emerg. Infect. Dis. 2025, 31, 3–8. [Google Scholar] [CrossRef]
  13. Moore, J.; Sanon, R.; Khudyakov, Y.; Barnes, N. Strategies and Opportunities to Improve Community Health through Advanced Molecular Detection and Genomic Surveillance of Infectious Diseases. Emerg. Infect. Dis. 2025, 31, 9–13. [Google Scholar] [CrossRef]
  14. Chen, J.; Qin, Z.; Jia, Z. The application status of sequencing technology in global respiratory infectious disease diagnosis. Infection 2024, 52, 2169–2181. [Google Scholar] [CrossRef] [PubMed]
  15. Maljkovic Berry, I.; Melendrez, M.C.; Bishop-Lilly, K.A.; Rutvisuttinunt, W.; Pollett, S.; Talundzic, E.; Morton, L.; Jarman, R.G. Next generation sequencing and bioinformatics methodologies for infectious disease research and public health: Approaches, applications, and considerations for development of laboratory capacity. J. Infect. Dis. 2020, 221, S292–S307. [Google Scholar] [CrossRef] [PubMed]
  16. Vashisht, V.; Vashisht, A.; Mondal, A.K.; Farmaha, J.; Alptekin, A.; Singh, H.; Ahluwalia, P.; Srinivas, A.; Kolhe, R. Genomics for emerging pathogen identification and monitoring: Prospects and obstacles. BioMedInformatics 2023, 3, 1145–1177. [Google Scholar] [CrossRef]
  17. Hilt, E.E.; Ferrieri, P. Next generation and other sequencing technologies in diagnostic microbiology and infectious diseases. Genes 2022, 13, 1566. [Google Scholar] [CrossRef]
  18. Miglietta, L.; Rawson, T.M.; Galiwango, R.; Tasker, A.; Ming, D.K.; Akogo, D.; Ferreyra, C.; Aboagye, E.O.; Gordon, N.C.; Garcia-Vidal, C.; et al. Artificial intelligence and infectious disease diagnostics: State of the art and future perspectives. Lancet Infect. Dis. 2025. [Google Scholar] [CrossRef]
  19. Pei, X.M.; Yeung, M.H.Y.; Wong, A.N.N.; Tsang, H.F.; Yu, A.C.S.; Yim, A.K.Y.; Wong, S.C.C. Targeted Sequencing Approach and Its Clinical Applications for the Molecular Diagnosis of Human Diseases. Cells 2023, 12, 493. [Google Scholar] [CrossRef]
  20. Marra, P.S.; Marra, A.R.; Chen, E.; Kobayashi, T.; Celeghini, P.D.; Gutfreund, M.C.; Pardo, I.; Lopes, G.O.V.; Hsieh, M.K.; Boodhoo, N.A.; et al. Metagenomic Next-generation Sequencing in Patients With Infectious Meningoencephalitis: A Comprehensive Systematic Literature Review and Meta-analysis. Open Forum Infect. Dis. 2025, 12, ofaf274. [Google Scholar] [CrossRef]
  21. Schrijver, I.; Aziz, N.; Farkas, D.H.; Furtado, M.; Gonzalez, A.F.; Greiner, T.C.; Grody, W.W.; Hambuch, T.; Kalman, L.; Kant, J.A.; et al. Opportunities and challenges associated with clinical diagnostic genome sequencing: A report of the Association for Molecular Pathology. J. Mol. Diagn. 2012, 14, 525–540. [Google Scholar] [CrossRef] [PubMed]
  22. Gaudin, M.; Desnues, C. Hybrid Capture-Based Next Generation Sequencing and Its Application to Human Infectious Diseases. Front. Microbiol. 2018, 9, 2924. [Google Scholar] [CrossRef] [PubMed]
  23. Wu, X.; Sun, T.; He, H.; Xing, L.; Cheng, Z.; Geng, S.; Xu, D.; Luo, H.; Chen, C.; Jiang, M.; et al. Effect of Metagenomic Next-Generation Sequencing on Clinical Outcomes of Patients with Severe Community-Acquired Pneumonia in the ICU: A Multicenter, Randomized Controlled Trial. Chest 2025, 167, 362–373. [Google Scholar] [CrossRef] [PubMed]
  24. Ghodousi, A.; Cannas, A.; Tagliani, E.; Batignani, V.; Bisognin, F.; Borroni, E.; Butera, O.; Codecasa, L.R.; Monte, P.D.; De Maio, F.; et al. Comprehensive Whole Genome Sequencing Dataset of Mycobacterium tuberculosis Strains Collected Across Italy. Sci. Data 2025, 12, 624. [Google Scholar] [CrossRef]
  25. Medugu, N.; Aworh, M.K.; Iregbu, K.; Nwajiobi-Princewill, P.; Hull, D.M.; Harden, L.; Singh, P.; Obaro, S.; Egwuenu, A.; Adeboye, F.; et al. Whole genome sequencing reveals virulence-mobile element linkages and phylogenetic diversity in multidrug-resistant Escherichia coli from Nigeria. Front. Microbiol. 2025, 16, 1579175. [Google Scholar] [CrossRef]
  26. Sherry, N.L.; Lee, J.Y.H.; Giulieri, S.G.; Connor, C.H.; Horan, K.; Lacey, J.A.; Lane, C.R.; Carter, G.P.; Seemann, T.; Egli, A.; et al. Genomics for antimicrobial resistance-progress and future directions. Antimicrob. Agents Chemother. 2025, 69, e0108224. [Google Scholar] [CrossRef]
  27. Wheeler, N.E.; Price, V.; Cunningham-Oakes, E.; Tsang, K.K.; Nunn, J.G.; Midega, J.T.; Anjum, M.F.; Wade, M.J.; Feasey, N.A.; Peacock, S.J.; et al. Innovations in genomic antimicrobial resistance surveillance. Lancet Microbe 2023, 4, e1063–e1070. [Google Scholar] [CrossRef]
  28. Zhou, X.; Wagh, K.; Lv, G.; Sharma, D.; Lei, W. Genome drafting of nosocomial infection CRE Klebsiella pneumoniae confirming resistance to colistin and eravacycline, carrying bla (NDM-1), mcr-1, and bla (KPC-2), in neonatology from November to December 2023. Front. Cell. Infect. Microbiol. 2024, 14, 1528017. [Google Scholar] [CrossRef]
  29. García-González, N.; Beamud, B.; Sevilla-Fortuny, J.; Sánchez-Hellín, V.; Vidal, I.; Rodríguez, J.C.; Fuster, B.; Tormo, N.; Salvador, C.; Gimeno, C.; et al. Genomic surveillance reveals different transmission patterns between third-generation cephalosporin and carbapenem resistance in Klebsiella pneumoniae in the Comunidad Valenciana (Spain), 2018–2020. Antimicrob. Resist. Infect. Control 2025, 14, 44. [Google Scholar] [CrossRef]
  30. Cholsaktrakool, P.; Kawang, K.; Sangpiromapichai, N.; Thongsuk, P.; Anuntakarun, S.; Kunadirek, P.; Chuaypen, N.; Nilgate, S.; Kueakulpattana, N.; Rirerm, U.; et al. Inference of antimicrobial resistance (AMR) from a whole genome database outperforming AMR gene detection. iScience 2025, 28, 112962. [Google Scholar] [CrossRef]
  31. Alcock, B.P.; Huynh, W.; Chalil, R.; Smith, K.W.; Raphenya, A.R.; Wlodarski, M.A.; Edalatmand, A.; Petkau, A.; Syed, S.A.; Tsang, K.K.; et al. CARD 2023: Expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database. Nucleic Acids Res. 2022, 51, D690–D699. [Google Scholar] [CrossRef]
  32. Florensa, A.F.; Kaas, R.S.; Clausen, P.; Aytan-Aktug, D.; Aarestrup, F.M. ResFinder—An open online resource for identification of antimicrobial resistance genes in next-generation sequencing data and prediction of phenotypes from genotypes. Microb. Genom. 2022, 8, 000748. [Google Scholar] [CrossRef]
  33. Feldgarden, M.; Brover, V.; Gonzalez-Escalona, N.; Frye, J.G.; Haendiges, J.; Haft, D.H.; Hoffmann, M.; Pettengill, J.B.; Prasad, A.B.; Tillman, G.E.; et al. AMRFinderPlus and the Reference Gene Catalog facilitate examination of the genomic links among antimicrobial resistance, stress response, and virulence. Sci. Rep. 2021, 11, 12728. [Google Scholar] [CrossRef]
  34. Kierner, S.; Kierner, P.; Kucharski, J. Combining machine learning models and rule engines in clinical decision systems: Exploring optimal aggregation methods for vaccine hesitancy prediction. Comput. Biol. Med. 2025, 188, 109749. [Google Scholar] [CrossRef]
  35. Matsumura, Y.; Yamamoto, M.; Gomi, R.; Tsuchido, Y.; Shinohara, K.; Noguchi, T.; Nagao, M. Integrating whole-genome sequencing into antimicrobial resistance surveillance: Methodologies, challenges, and perspectives. Clin. Microbiol. Rev. 2025, e0014022. [Google Scholar] [CrossRef]
  36. Hassall, J.; Coxon, C.; Patel, V.C.; Goldenberg, S.D.; Sergaki, C. Limitations of current techniques in clinical antimicrobial resistance diagnosis: Examples and future prospects. NPJ Antimicrob. Resist. 2024, 2, 16. [Google Scholar] [CrossRef]
  37. Eladawy, M.; Heslop, N.; Negus, D.; Thomas, J.C.; Hoyles, L. Phenotype–genotype discordance in antimicrobial resistance profiles of Gram-negative uropathogens recovered from catheter-associated urinary tract infections in Egypt. bioRxiv 2025. bioRxiv: 2025.2004.2017.649370. [Google Scholar] [CrossRef] [PubMed]
  38. Ugarcina Perovic, S.; Ramji, V.; Chong, H.; Duan, Y.; Maguire, F.; Coelho, L.P. argNorm: Normalization of antibiotic resistance gene annotations to the Antibiotic Resistance Ontology (ARO). Bioinformatics 2025, 41, btaf173. [Google Scholar] [CrossRef] [PubMed]
  39. Mendelsohn, E.; Ross, N.; White, A.M.; Whiting, K.; Basaraba, C.; Madubuonwu, B.W.; Johnson, E.; Dualeh, M.; Matson, Z.; Dattaray, S. A global repository of novel antimicrobial emergence events. F1000Research 2021, 9, 1320. [Google Scholar] [CrossRef] [PubMed]
  40. Altamirano, S.; Jackson, K.M.; Nielsen, K. The interplay of phenotype and genotype in Cryptococcus neoformans disease. Biosci. Rep. 2020, 40, BSR20190337. [Google Scholar] [CrossRef]
  41. Yu, H.; Li, L.; Huffman, A.; Beverley, J.; Hur, J.; Merrell, E.; Huang, H.H.; Wang, Y.; Liu, Y.; Ong, E.; et al. A new framework for host-pathogen interaction research. Front. Immunol. 2022, 13, 1066733. [Google Scholar] [CrossRef]
  42. Espinosa, E.; Bautista, R.; Larrosa, R.; Plata, O. Advancements in long-read genome sequencing technologies and algorithms. Genomics 2024, 116, 110842. [Google Scholar] [CrossRef] [PubMed]
  43. Maciel-Guerra, A.; Babaarslan, K.; Baker, M.; Rahman, A.; Hossain, M.; Sadique, A.; Alam, J.; Uzzaman, S.; Ferdous Rahman Sarker, M.; Sultana, N.; et al. Core and accessory genomic traits of Vibrio cholerae O1 drive lineage transmission and disease severity. Nat. Commun. 2024, 15, 8231. [Google Scholar] [CrossRef] [PubMed]
  44. Kumavath, R.; Gupta, P.; Tatta, E.R.; Mohan, M.S.; Salim, S.A.; Busi, S. Unraveling the role of mobile genetic elements in antibiotic resistance transmission and defense strategies in bacteria. Front. Syst. Biol. 2025, 5, 1557413. [Google Scholar] [CrossRef] [PubMed]
  45. Naureen, Z.; Dautaj, A.; Anpilogov, K.; Camilleri, G.; Dhuli, K.; Tanzi, B.; Maltese, P.E.; Cristofoli, F.; De Antoni, L.; Beccari, T.; et al. Bacteriophages presence in nature and their role in the natural selection of bacterial populations. Acta Biomed. 2020, 91, e2020024. [Google Scholar] [CrossRef]
  46. Sun, Z.; Zhang, J.; Wang, C.; Chen, J.; Li, P.; Su, J.; Xu, X.; Wang, M. The pivotal role of IncFIB (Mar) plasmid in the emergence and spread of hypervirulent carbapenem-resistant Klebsiella pneumoniae. Sci. Adv. 2025, 11, eado9097. [Google Scholar] [CrossRef]
  47. Zhou, S.; Liu, B.; Zheng, D.; Chen, L.; Yang, J. VFDB 2025: An integrated resource for exploring anti-virulence compounds. Nucleic Acids Res. 2025, 53, D871–D877. [Google Scholar] [CrossRef]
  48. Lau, W.Y.V.; Taylor, P.K.; Brinkman, F.S.L.; Lee, A.H.Y. Pathogen-associated gene discovery workflows for novel antivirulence therapeutic development. eBioMedicine 2023, 88, 104429. [Google Scholar] [CrossRef]
  49. Sette, A.; Rappuoli, R. Reverse vaccinology: Developing vaccines in the era of genomics. Immunity 2010, 33, 530–541. [Google Scholar] [CrossRef]
  50. Shendure, J.; Findlay, G.M.; Snyder, M.W. Genomic Medicine-Progress, Pitfalls, and Promise. Cell 2019, 177, 45–57. [Google Scholar] [CrossRef]
  51. Shao, J.; Ma, J.; Yu, Y.; Zhang, S.; Wang, W.; Li, W.; Wang, C. A multimodal integration pipeline for accurate diagnosis, pathogen identification, and prognosis prediction of pulmonary infections. Innovation 2024, 5, 100648. [Google Scholar] [CrossRef] [PubMed]
  52. Khan, A.; Barapatre, A.R.; Babar, N.; Doshi, J.; Ghaly, M.; Patel, K.G.; Nawaz, S.; Hasana, U.; Khatri, S.P.; Pathange, S.; et al. Genomic medicine and personalized treatment: A narrative review. Ann. Med. Surg. 2025, 87, 1406–1414. [Google Scholar] [CrossRef] [PubMed]
  53. Jamalinia, M.; Weiskirchen, R. Advances in personalized medicine: Translating genomic insights into targeted therapies for cancer treatment. Ann. Transl. Med. 2025, 13, 18. [Google Scholar] [CrossRef] [PubMed]
  54. Echavarría, M.; Marcone, D.N.; Querci, M.; Seoane, A.; Ypas, M.; Videla, C.; O’Farrell, C.; Vidaurreta, S.; Ekstrom, J.; Carballal, G. Clinical impact of rapid molecular detection of respiratory pathogens in patients with acute respiratory infection. J. Clin. Virol. 2018, 108, 90–95. [Google Scholar] [CrossRef]
  55. Nazari, M.; Hemmati, J.; Asghari, B. Comprehensive Analysis of Virulence Genes, Antibiotic Resistance, Biofilm Formation, and Sequence Types in Clinical Isolates of Klebsiella pneumoniae. Can. J. Infect. Dis. Med. Microbiol. 2024, 2024, 1403019. [Google Scholar] [CrossRef]
  56. Cogliati Dezza, F.; Olivares-Navarro, P.; Caponcello, M.G.; Salazar-Moscoso, M.; Cosano-Perez, P.; Retamar Gentil, P.; Alonso-González, I.; Oteo-Iglesias, J.; Torre-Cisneros, J.; Horcajada, J.P. An International Delphi Survey on Priorities in Antimicrobial Resistant Infections Therapeutic Research: A Preliminary Study of MePRAM Project. SSRN 5320670. Available online: https://dx.doi.org/10.2139/ssrn.5320670 (accessed on 10 August 2025).
  57. Aalam, J.; Ahmad Shah, S.N.; Parveen, R. An extensive review on infectious disease diagnosis using machine learning techniques and next generation sequencing: State-of-the-art and perspectives. Comput. Biol. Med. 2025, 189, 109962. [Google Scholar] [CrossRef]
  58. Wanichthanarak, K.; Fahrmann, J.F.; Grapov, D. Genomic, Proteomic, and Metabolomic Data Integration Strategies. Biomark. Insights 2015, 10, 1–6. [Google Scholar] [CrossRef]
  59. DeGroat, W.; Abdelhalim, H.; Peker, E.; Sheth, N.; Narayanan, R.; Zeeshan, S.; Liang, B.T.; Ahmed, Z. Multimodal AI/ML for discovering novel biomarkers and predicting disease using multi-omics profiles of patients with cardiovascular diseases. Sci. Rep. 2024, 14, 26503. [Google Scholar] [CrossRef]
  60. Roberts, M.C.; Holt, K.E.; Del Fiol, G.; Baccarelli, A.A.; Allen, C.G. Precision public health in the era of genomics and big data. Nat. Med. 2024, 30, 1865–1873. [Google Scholar] [CrossRef]
  61. Karakoyun, H.K.; Sayar, C.; Yararbaş, K. Challenges in clinical interpretation of next-generation sequencing data: Advantages and Pitfalls. Results Eng. 2023, 20, 101421. [Google Scholar] [CrossRef]
  62. Marklewitz, M.; Jaguparov, A.; Wilhelm, A.; Akande, O.W.; Musul, B.; Poates, A.L.; Afrough, B.; Norberg, A.; Hull, N.C.; Ehsani, S.; et al. Genomics costing tool: Considerations for improving cost-efficiencies through cross scenario comparison. Front. Public Health 2025, 12, 1498094. [Google Scholar] [CrossRef]
  63. Phillips, K.A.; Douglas, M.P.; Wordsworth, S.; Buchanan, J.; Marshall, D.A. Availability and funding of clinical genomic sequencing globally. BMJ Global Health 2021, 6, e004415. [Google Scholar] [CrossRef]
  64. Heeney, C.; Hawkins, N.; de Vries, J.; Boddington, P.; Kaye, J. Assessing the privacy risks of data sharing in genomics. Public Health Genom. 2011, 14, 17–25. [Google Scholar] [CrossRef]
  65. Sundermann, A.J.; Rosa, R.; Harris, P.N.A.; Snitkin, E.; Javaid, W.; Moore, N.M.; Hayden, M.K.; Allen, K.; Rodino, K.; Peacock, S.J.; et al. Pathogen genomics in healthcare: Overcoming barriers to proactive surveillance. Antimicrob. Agents Chemother. 2025, 69, e01479-24. [Google Scholar] [CrossRef] [PubMed]
  66. Talha, M.; Shamim, L.; Oduoye, M.O. Crowdsourced genomic surveillance for emerging pathogens: A 2025 public health strategy. J. Public Health Res. 2025, 14, 22799036251356354. [Google Scholar] [CrossRef] [PubMed]
  67. Holden, N.J. Data sharing considerations to maximize the use of pathogen biological and genomics resources data for public health. J. Appl. Microbiol. 2024, 135, lxae204. [Google Scholar] [CrossRef] [PubMed]
  68. Munung, N.S. Science and Society: Pathways to Equitable Access and Delivery of Genomics Medicine in Africa. Curr. Genet. Med. Rep. 2025, 13, 1. [Google Scholar] [CrossRef]
  69. Test, W.D.; Genie, L.; Test, S.D.; Genie, H.; Genie, F.; Genie, S.; Plus, H.G.; Test, H.D.; Test, D.D.; Test, H.D. Breakthroughs in Genomic Medicine: What’s New in 2025? Pharmacogenomics 2025. [Google Scholar]
  70. Mani, S.; Lalani, S.R.; Pammi, M. Genomics and multiomics in the age of precision medicine. Pediatr. Res. 2025, 97, 1399–1410. [Google Scholar] [CrossRef]
Figure 1. Workflow of genome sequencing in infectious disease diagnostics. The process begins with specimen extraction, followed by pathogen enrichment and library preparation and culminates in sequencing and quality control (QC) assessment. The resulting data are then analyzed through taxonomic classification and antimicrobial resistance (AMR) detection, culminating in clinical interpretation and integration of the final report into the electronic health record (EHR) system.
Figure 1. Workflow of genome sequencing in infectious disease diagnostics. The process begins with specimen extraction, followed by pathogen enrichment and library preparation and culminates in sequencing and quality control (QC) assessment. The resulting data are then analyzed through taxonomic classification and antimicrobial resistance (AMR) detection, culminating in clinical interpretation and integration of the final report into the electronic health record (EHR) system.
Pharmaceuticals 18 01687 g001
Figure 2. Comparative roles of sequencing approaches. A two-axis framework highlights differences between targeted and untargeted assays on one axis and between short- and long-read technologies on the other. Targeted short-read assays deliver high sensitivity for predefined loci, while untargeted short-read metagenomics enables hypothesis-free detection. Targeted long-read strategies resolve plasmids and complex loci, whereas untargeted long-read sequencing supports real-time outbreak response and structural characterization.
Figure 2. Comparative roles of sequencing approaches. A two-axis framework highlights differences between targeted and untargeted assays on one axis and between short- and long-read technologies on the other. Targeted short-read assays deliver high sensitivity for predefined loci, while untargeted short-read metagenomics enables hypothesis-free detection. Targeted long-read strategies resolve plasmids and complex loci, whereas untargeted long-read sequencing supports real-time outbreak response and structural characterization.
Pharmaceuticals 18 01687 g002
Figure 3. Genomic prediction of AMR. Genome or metagenome sequences enter rule-based pipelines that detect known genes and mutations, supplemented by machine-learning models that capture complex genomic patterns. Outputs predict resistance to antibiotic classes such as carbapenems, fluoroquinolones, or aminoglycosides. Feedback from phenotypic testing informs continuous refinement of the database and improvement of the model.
Figure 3. Genomic prediction of AMR. Genome or metagenome sequences enter rule-based pipelines that detect known genes and mutations, supplemented by machine-learning models that capture complex genomic patterns. Outputs predict resistance to antibiotic classes such as carbapenems, fluoroquinolones, or aminoglycosides. Feedback from phenotypic testing informs continuous refinement of the database and improvement of the model.
Pharmaceuticals 18 01687 g003
Figure 4. Applications of genome sequencing in infectious diseases. A hub-and-spoke schematic highlights diverse roles of sequencing, including pathogen identification, AMR prediction, outbreak investigation, population surveillance, virulence mapping, precision medicine, and public health decision support. Together, these applications demonstrate how sequencing bridges laboratory genomics with patient care and population health. “Novel pathogens” refers to newly emerging or previously uncharacterized strains associated with outbreaks or unexplained clinical syndromes; this includes both new species and novel genomic variants of known pathogens.
Figure 4. Applications of genome sequencing in infectious diseases. A hub-and-spoke schematic highlights diverse roles of sequencing, including pathogen identification, AMR prediction, outbreak investigation, population surveillance, virulence mapping, precision medicine, and public health decision support. Together, these applications demonstrate how sequencing bridges laboratory genomics with patient care and population health. “Novel pathogens” refers to newly emerging or previously uncharacterized strains associated with outbreaks or unexplained clinical syndromes; this includes both new species and novel genomic variants of known pathogens.
Pharmaceuticals 18 01687 g004
Table 1. Comparative summary of sequencing platforms for infectious disease applications *.
Table 1. Comparative summary of sequencing platforms for infectious disease applications *.
TechnologyPlatform
Examples
Typical Read LengthAccuracy (Per Base)Throughput (Per Run)StrengthsLimitationsRepresentative Applications
Short-read sequencingIllumina MiSeq, NextSeq, NovaSeq50–300 bp (paired-end)>99.9%Up to hundreds of Gb, depending on systemHigh accuracy, cost-effective, standardized pipelinesFragmented assemblies in repetitive regions; poor plasmid/structural resolutionRoutine bacterial WGS, viral surveillance, variant calling
Long-read sequencing (Nanopore)ONT MinION, GridION, PromethION1 kb to several Mb~90–99% (raw)Tens to hundreds of GbReal-time sequencing, portable, long contigsLower per-base accuracy; error-prone homopolymersRapid pathogen ID, outbreak response, metagenomics
Long-read sequencing (PacBio HiFi)Sequel IIe, Revio15–20 kb (up to 50 kb)>99.9%~60–120 GbHighly accurate long reads, excellent assembly qualityHigher cost per run; longer prep workflowsComplete assemblies, plasmid and resistance island mapping
Targeted sequencingIon Torrent, AmpliSeq panels200–600 bp~98–99%Few GbFast turnaround, focused panels, high sensitivity for known lociRequires prior knowledge; limited to predefined targetsAMR gene panels, viral genotyping, clinical diagnostics
* Short-read Illumina systems provide high accuracy at modest read lengths, while Nanopore and PacBio deliver long reads that enable structural resolution and plasmid assembly. Targeted approaches complement these methods by enabling rapid, sensitive detection of predefined loci.
Table 2. Integration of genome sequencing into clinical diagnostic workflows *.
Table 2. Integration of genome sequencing into clinical diagnostic workflows *.
Clinical StageConventional ApproachSequencing-Enhanced ApproachAdvantages of SequencingChallenges
Pathogen identificationCulture, microscopy, antigen testsShotgun metagenomic sequencingCulture-independent, hypothesis-free, broad detectionRequires sufficient pathogen load; host DNA background
Resistance profilingPhenotypic AST (growth-based)Whole-genome sequencing (resistome analysis)Faster turnaround, multiple resistance determinants at onceNovel/unknown mechanisms may be missed; interpretation complexity
Outbreak investigationPFGE, MLST, serotypingWhole-genome sequencing (phylogenomics)Ultra-fine resolution enables transmission mappingRequires bioinformatics infrastructure; careful interpretation
Treatment decision-makingEmpirical therapy based on clinical suspicionGenomic-informed therapy (ID + AMR prediction)Enables precision prescribing, stewardship optimizationIntegration into clinical workflows, need for rapid turnaround
* Sequencing augments or replaces conventional methods at multiple stages of care, enabling culture-independent detection, rapid resistance prediction, high-resolution outbreak analysis, and more precise therapeutic guidance, while challenges remain in turnaround time, interpretation, and implementation.
Table 3. Representative tools and databases for genomic AMR prediction *.
Table 3. Representative tools and databases for genomic AMR prediction *.
Tool/DatabaseMethodologyDetectsKey FeaturesLimitations
ResFinderBLAST search against curated gene setsAcquired resistance genesWeb-based, easy-to-use, regularly updatedLimited to novel/rare mechanisms
CARD (RGI)BLAST + HMM modelsResistance genes, allelic variants, mutationsBroad ontology; includes efflux, enzymatic, and regulatory mechanismsRequires expertise to interpret outputs
AMRFinderPlusBLAST + HMMsGenes and key resistance mutationsIntegrated with NCBI pathogen pipelines; curated reference setsPrimarily bacterial pathogens; limited viral coverage
ARIBALocal de novo assembly + alignmentGenes and alleles (customizable)Flexible use with custom databases, suitable for local validationRequires high-quality assemblies; more complex setup
* Widely used platforms such as ResFinder, CARD, AMRFinderPlus, and ARIBA employ different computational strategies and curated datasets to identify acquired genes and resistance-associated mutations. Each provides complementary strengths for surveillance and clinical application.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hasan, G.M.; Mohammad, T.; Shamsi, A.; Sohal, S.S.; Hassan, M.I. Applications of Genome Sequencing in Infectious Diseases: From Pathogen Identification to Precision Medicine. Pharmaceuticals 2025, 18, 1687. https://doi.org/10.3390/ph18111687

AMA Style

Hasan GM, Mohammad T, Shamsi A, Sohal SS, Hassan MI. Applications of Genome Sequencing in Infectious Diseases: From Pathogen Identification to Precision Medicine. Pharmaceuticals. 2025; 18(11):1687. https://doi.org/10.3390/ph18111687

Chicago/Turabian Style

Hasan, Gulam Mustafa, Taj Mohammad, Anas Shamsi, Sukhwinder Singh Sohal, and Md. Imtaiyaz Hassan. 2025. "Applications of Genome Sequencing in Infectious Diseases: From Pathogen Identification to Precision Medicine" Pharmaceuticals 18, no. 11: 1687. https://doi.org/10.3390/ph18111687

APA Style

Hasan, G. M., Mohammad, T., Shamsi, A., Sohal, S. S., & Hassan, M. I. (2025). Applications of Genome Sequencing in Infectious Diseases: From Pathogen Identification to Precision Medicine. Pharmaceuticals, 18(11), 1687. https://doi.org/10.3390/ph18111687

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop