Next Article in Journal
Comprehensive Review of Silver Nanoparticles in Food Packaging Applications
Previous Article in Journal
Neurochemical Remodelling of the Enteric Nervous System Neurons in the Porcine Jejunum Following Low-Dose Glyphosate Exposure
Previous Article in Special Issue
N-Methylpyridinium Porphyrin Complexes as Sensitizers for Sonodynamic Therapy Against Planktonic and Biofilm-Forming Multidrug-Resistant Microbes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessment of Microbiome-Based Pathogen Detection Using Illumina Short-Read and Nanopore Long-Read Sequencing in 144 Patients Undergoing Bronchoalveolar Lavage in a University Hospital in Germany

1
Institute of Medical Microbiology, Medical Microbiome-Metagenome Unit (M3U), Justus Liebig University Giessen, 35392 Giessen, Germany
2
German Center for Infection Research (DZIF), Partner Site Giessen-Marburg-Langen, Justus Liebig University Giessen, 35392 Giessen, Germany
3
Institute of Medical Informatics, Justus Liebig University Giessen, 35392 Giessen, Germany
4
Department of Internal Medicine, Universities of Giessen and Marburg Lung Center (UGMLC), German Center for Lung Research (DZL), Justus Liebig University Giessen, 35392 Giessen, Germany
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2025, 26(20), 9841; https://doi.org/10.3390/ijms26209841
Submission received: 27 June 2025 / Revised: 21 September 2025 / Accepted: 27 September 2025 / Published: 10 October 2025
(This article belongs to the Collection Feature Papers in Molecular Microbiology)

Abstract

Lower respiratory tract infections (LRTIs) represent a significant global health concern, and the accurate identification of pathogens is crucial for patient care. Culture-based methods are the gold standard, but their detection abilities are limited. Next-generation sequencing (NGS) offers a promising method for comprehensive microbial detection, providing valuable information for clinical practice. In this study, 144 bronchoalveolar lavage fluid samples were collected, culture-based diagnostics were performed, and bacterial microbiome profiles were generated by short-read sequencing of the V4 region of the 16S rRNA gene using Illumina technologies and long-read sequencing with Oxford Nanopore Technologies (ONT) to determine the full-length 16S rRNA gene. The most common genera detected by NGS included Streptococcus, Staphylococcus, Veillonella, Prevotella, Rothia, Enterococcus, and Haemophilus. Short-read sequencing detected cultured bacteria at the genus level in ~85% of cases, while long-read sequencing demonstrated agreement with cultured species in ~62% of cases. In three cases, long-read sequencing identified the uncommon potential lung pathogen Tropheryma whipplei not detected with traditional culturing techniques. The NGS results showed a partial overlap with culture as the current diagnostic gold standard in LRTI. Additionally, NGS detected a broader spectrum of bacteria, revealed fastidious potential pathogens, and offered deeper insights into the complex microbial ecosystem of the lungs.

1. Introduction

The lung microbiome is one of the least populated microbiomes in the human body [1,2,3] and has not been explored as thoroughly as other bodily habitats such as the gut [4,5,6]. However, accumulating knowledge over the last decade has highlighted the (patho-)physiological importance of the lung microbiome in health and in acute and chronic disease [2,7]. The microbiome of the healthy lower respiratory tract is very dynamic and transient, with high bacterial diversity and low bacterial density [2,4,8]. In pneumonia, the local microbiome structure in the lung is scrambled, allowing one or a few pathogenic taxa to predominate [9,10]. The conventional linear model, in which a pathogen enters the lung and triggers an infection, has gradually been replaced by a more dynamic, multilayer concept of pathogenesis [10,11]. Moreover, the clear distinction between pathogenic and non-pathogenic organisms is increasingly being blurred, as common pathogens associated with lower respiratory tract infections (LRTIs), such as Streptococcus pneumoniae, are frequently present in the airways of asymptomatic individuals [6,10,12].
This paradigm shift has been considerably propelled by the introduction of next-generation sequencing (NGS) [2,9,10]. The combination of massively parallel sequencing techniques and bioinformatic data analysis has fundamentally transformed both understanding and research of the human microbiome [2,8,13]. While previous diagnostic techniques target a specific pathogen a priori, NGS embodies a broader approach by mapping the lung microbiome as a complex community [10].
LRTIs are a leading contributor to morbidity and mortality worldwide. In 2021, the global incidence of non-COVID-19 LRTI was estimated at 344 million episodes, resulting in 2.18 million deaths, primarily among vulnerable populations such as infants, the elderly, and those affected by air pollution or extreme poverty by air pollution or extreme poverty [14,15]. The precise and streamlined identification of causative agents in LRTI is crucial for effective patient care. However, pathogens often go undetected by the existing culture-based methods [16]. In their systematic review, Shoar and Musher demonstrated that in over half of the cases studied, conventional methods such as culturing and PCR failed to find an etiologic microbe for community-acquired pneumonia [17]. In addition, certain bacteria, such as Bartonella, Legionella, Mycobacterium, and other atypical bacteria, are challenging to cultivate [18,19,20].
NGS techniques offer a promising approach for improving pathogen detection in samples acquired from the respiratory tract and other parts of the body, as demonstrated by numerous studies [21,22,23,24,25,26,27,28]. Furthermore, they could provide valuable information for clinical practice, such as microbial diversity or DNA concentration as an indicator of exceeding microbial growth [29,30]. Overall, this could lead to more personalised patient care [31,32,33].
This study aimed to systematically compare the bacterial microbiome profiles of bronchoalveolar lavage fluid (BALF) samples using Illumina short-read sequencing of the V4 region of the 16S rRNA gene, long-read sequencing of the full-length 16S rRNA gene via Oxford Nanopore Technologies (ONT) and culture-based routine diagnostics, employing a rapid ONT sequencing workflow for pathogen detection.
Moreover, we assessed whether DNA concentration and α-diversity varied across samples that grew a potential pathogen, a commensal, or had no growth in the culture.

2. Results

From December 2020 to October 2021, 144 BALF samples from patients undergoing bronchoalveolar lavage at the University Hospital Giessen (Germany) were collected. These samples were cultured and reported within 24 to 48 h by the Institute of Medical Microbiology’s routine diagnostic unit (Supplementary Figure S1a). A total of 22 samples had to be excluded from this study due to there being an insufficient amount of fluid (12) or a positive test for SARS-CoV-2 (10). The remaining 122 samples were sequenced with Illumina short-read sequencing, resulting in 106 samples with at least 1400 sequence reads included in the analysis. The Illumina workflow for sample processing, sequencing, and analysis took approximately 32 h (Supplementary Figure S1b). As a final part of the study, 102 of the 122 included patient samples were sequenced with ONT long-read sequencing. A total of 20 samples had to be excluded due to insufficient remaining DNA extracts for further sequencing. In total, 82 BALF samples reached the minimum threshold of 20 classified sequence reads and were therefore included in the analysis. The microbiome profiles for these samples were generated within 8 h, which limited sequencing to two hours. A total of 82 BALF samples reached the minimum threshold of 20 classified sequence reads and were therefore included in the analysis. The microbiome profile for these samples was within 8 h, which limited sequencing to two hours (Supplementary Figure S1c). In this study, we focused on the 106 samples for which we obtained overlapping NGS and culture results (Figure 1).

2.1. Culture-Based Diagnostics

Bacterial growth was observed in 67 samples (63.2%), with a total of 28 species identified in culture. The most frequently reported taxa included α-haemolytic Streptococcus, saprophytic Neisseria, and Staphylococcus aureus. Bacteria detected by culture were assessed for pathogenicity according to the S3 guidelines for community- and hospital-acquired pneumonia, using additional sources when guideline information on specific species was unavailable (Supplementary Table S1), resulting in 9 of the 28 cultured species being classified as potentially pathogenic. In 26 samples (24.5%), at least one potential pathogen could be identified (Supplementary Table S2).

2.2. Results of Illumina 16S rRNA Gene V4 Region Sequencing

The 16S rRNA gene sequencing of the V4 region using Illumina short-read sequencing technology yielded a total of 685,459 sequence reads, with a median of 5314 sequence reads per sample (IQR, 3344–8471). The genera most frequently identified through Illumina sequencing included Streptococcus, Staphylococcus, Veillonella, Prevotella 7, Rothia, and Enterococcus. We categorised our microbiome profiles generated by Illumina short-read and ONT long-read sequencing into four groups (see Section 4.5 and Supplementary Table S3). A total of 13 BALF samples displayed a monomicrobial profile, while 30 samples showed a polymicrobial profile. Additionally, 45 BALF samples were characterised by a multi-microbial profile, and 18 samples were included in the low-biomass group (Supplementary Figure S2). The monomicrobial samples demonstrated the presence of the genera Staphylococcus (4), Pseudomonas (2), Stenotrophomonas (2), Enterococcus (2), Haemophilus (1), and Streptococcus (1), as well as bacteria of the order Micrococalles (1) (Figure 2, Supplementary Table S2).

2.3. Results of ONT 16S rRNA Gene Full-Length Sequencing

The ONT long-read sequencing of 102 samples produced 81 samples with a minimum sequence read count of 20, which were included in this analysis. For those 81 samples, a total of 398,144 sequence reads were generated, with a median of 329 sequence reads per sample (IQR, 109–1279). A total of 322 distinct species were observed, representing 144 genera. The three most abundant genera were Streptococcus, Veillonella, and Staphylococcus, which is in line with the Illumina results. In contrast to the culture results, Neisseria was not among the most abundant genera in the sequencing results. On the genus level, 25 samples were categorised as monomicrobial, 27 as polymicrobial, 19 samples showed a multi-microbial profile, and 10 samples were of the low-biomass category (Figure 3, Supplementary Figure S3). While most of the detected species were commensals of the upper respiratory tract, we detected one or more potential pathogens in 22 BALF samples (27.2% of the included BALF samples). Three potential pathogens identified using ONT long-read sequencing were not observed in culture, namely, Listeria monocytogenes, Streptococcus pseudopneumoniae, and Tropheryma whipplei (Supplementary Table S2).

2.4. DNA Yield of BALF Samples

We categorised our microbiome profiles generated by Illumina short-read and ONT long-read sequencing into four groups (see Section 4.5 and Supplementary Table S3). The DNA yield in samples in which a potential pathogen was cultivated was significantly higher compared to samples with either negative culture results or those containing commensal bacteria (Figure 4a). In samples sequenced by short-read sequencing, the Shannon index indicated a lower diversity in samples containing potential pathogens in culture compared to samples with commensals (Figure 4b), suggesting the presence of a few dominant bacteria.
The DNA yield of the monomicrobial classified samples was the highest in short-read and long-read sequencing, followed by the polymicrobial samples, indicating that the DNA yield is the highest in samples where a small number of bacterial species are dominant. The low-biomass samples, along with the multi-microbial samples, produced the lowest DNA yield, which aligns with the observation that contaminations were predominant in these types of samples (Figure 4c,d).

2.5. Validation of Culture-Based Diagnostics with NGS

To investigate to what extent culture results match NGS data, Table 1 was established. It shows the species detected in the culture alongside their matching species via ONT sequencing and their genera via Illumina short-read sequencing. This study generated one to four culture results for 67 of the 106 BALF samples, leading to a total of 118 positive culture reports. Because of the lower number of ONT-sequenced samples included in this study, long-read sequencing covers only 50 culture-positive samples with 91 positive culture reports.
A corresponding genus was identified in 100 out of 118 cases (84.7%) by short-read sequencing. In 51 cases (43.2%), these genera showed a relative abundance of 10% or more. All 35 samples with a culture report of α-haemolytic Streptococcus had a corresponding genus in the Illumina short-read sequencing data, albeit at varying ranks. For S. aureus and coagulase-negative Staphylococcus, the corresponding genus was missing in 35% of cases. Among the 27 reports of potential pathogens, 9 lacked a corresponding genus in short-read sequencing (33.3%). Neither of the two Citrobacter koseri reports, nor the Serratia marcescens report from routine diagnostics, could be validated with Illumina 16S rRNA gene sequencing data. Furthermore, two reports for Escherichia coli could not be confirmed because of the presence of this species in the negative controls (Table 1, Supplementary Table S2).
A total of 56 of the 91 culture-positive reports corresponding to the long-read-sequenced samples matched on species level (61.5%); however, 8 of these reports were verified with fewer than 10 sequence reads. For α-haemolytic Streptococcus, 24 of 26 reports were covered by ONT long-read sequencing (92.2%). Coagulase-negative Staphylococci were not detected by long-read sequencing in two cases. The S. aureus determination was limited to three out of eight culture reports (37.5%). In contrast to short-read sequencing, both reports of C. koseri were also confirmed by long-read NGS. The detection of E. coli was not feasible due to the overlap with the negative controls. The cultured species Neisseria flava could not be retraced by Nanopore sequencing; however, Neisseria flavescens and Neisseria perflava were sequenced in these two samples instead. In a similar fashion, the third Stenotrophomonas maltophilia was detected as Stenotrophomonas sp. MYb57 (Table 1, Supplementary Table S2).

2.6. Expanding Bacterial Pathogen Detection by NGS

The limited resolution of Illumina short-read sequencing of the V4 region of the 16S rRNA gene makes the differentiation between pathogens and commensals of the same species impossible. in many cases. The uncommon potential pathogen T. whipplei, not found by routine culture diagnostics, was an exception to this and was detected by short-read NGS with an extended classification using BLASTn [34].
ONT long-read 16S rRNA gene full-length sequencing uncovered potential pathogens in nine BALF samples not detected by culture. In seven of those nine cases, we found a potential pathogenic species that was not detected in any culture result (L. monocytogenes, Prevotella oris, Streptococcus pseudopneumoniae, T. whipplei, and Streptococcus agalactiae). Four of those nine culture-negative cases could not be confirmed by Illumina short-read sequencing on genus level. These were Haemophilus influenzae (1), L. monocytogenes (2), and S. aureus (1) (Figure 5). In three cases, the potential pathogens detected in the culture and via long-read sequencing did not match (Supplementary Table S2).

3. Discussion

NGS has become a key driver of advancements in microbiological research and holds significant promise for pathogen detection in the future. In this study, we showed that the most frequently detected genera by targeted 16S rRNA gene sequencing from patient BALF samples were Streptococcus, Staphylococcus, and Veillonella, which aligns with findings from several previous studies [8,35,36].
A total of 31 of the 82 BALF samples sequenced with Illumina and ONT agreed on the most abundant genera, which could be due to the different 16S rRNA gene primers used in the two sequencing approaches. Additional factors, such as genomic DNA extraction, the library construction, the sequencing platform, and the bioinformatic processing workflow, can influence the sequencing results and pathogen detection [31,37,38,39,40]. When aiming for a future routine application of NGS, this has to be taken into account, and standardised protocols have to be established. A combination of different NGS techniques or NGS and culture-based diagnostics can harden a diagnostic strategy against single points of failure, such as incomplete primer coverage, low sequencing depth, or unculturable bacteria.
To systematically compare culture results with Illumina and ONT sequencing, Table 1 was generated, which illustrates the overlap between the species identified in culture and the taxa detected by NGS. The detection rate of NGS (Illumina: 84,7%; ONT: 61,5%) was lower than in similar studies based on Illumina, such as Zachariah et al. [28], and Yoo et al. [27], or studies based on ONT, such as Lao et al. [41]. False negatives-negative NGS results may be attributed to factors such as low microbial load [27], incomplete primer coverage, and limited sequencing depth. However, false-positive culture results may also be possible.
Four samples revealed a different pathogen in ONT long-read sequencing than that reported by culture-based diagnostics. Five possible pathogens detected by culture had a relative abundance of under 3% in the microbiome profiles, which could diminish the pathogen detection ability of Illumina and ONT sequencing.
However, in several samples, genera of clinical significance, such as Haemophilus and Staphylococcus, were identified through Illumina sequencing even though culture results were negative. Notably, genus-level resolution is insufficient to distinguish commensals from pathogens such as S. aureus from S. epidermidis, highlighting the critical need for species-level identification in a clinical setting. Therefore, ONT 16S rRNA full-length gene sequencing was conducted for higher taxonomic resolution. In nine cases, ONT sequencing detected the following potential pathogens that were absent in the culture: H. influenzae (1), L. monocytogenes (2), P. oris (1), S. aureus (1), S. pseudopneumoniae (1), T. whipplei (2), and a coinfection with S. pseudopneumoniae and S. agalactiae (1). T. whipplei was also confirmed by Illumina sequencing. The causative agent of Whipple’s disease typically affects the gastrointestinal tract. Still, colonisation or infection of the lung were observed in a limited number of cases [42]. L. monocytogenes cases were only detected by ONT long-read sequencing. This bacterium can cause severe and life-threatening diseases, which generally manifest as bacteraemia and/or meningitis in neonates and elderly and immunocompromised patients; however, pulmonary listeriosis in adults has also been described [43].
The microbiome profiles offered further qualitative and quantitative insights into the bacterial composition, also labelled as “culture-independent indices of infection” [25]. In this study, the DNA yield, reflecting the DNA concentration in BALF extracts, was significantly higher in samples with a potential pathogen in culture (Figure 4a). This observation is consistent with a study by Dickson et al. [25]. However, individual values could not reliably be used as a single discriminator to determine whether a potential infection might be present. Concerning diversity indices, several studies have linked lower diversity indices to pathogen isolation in culture [23,44,45], which could only be partially reproduced by our study (Figure 4b).
The division into multi-microbial, based on the frequency distribution of the bacteria, aimed to place the NGS results in a broader context and to attain more generalisable findings. In addition, it was used for a structured comparison with the culture results. We indicate these profiling parameters in Figure 4c. However, the classification is somewhat artificial, as it oversimplifies the lung microbiome’s complex composition. The division into categories (mono-/poly-/multi-microbial and low-biomass) based on the frequency distribution of the bacteria aimed to place the NGS results in a broader context and to attain more generalisable findings. In addition, it was used for a structured comparison with the culture results. We assumed that the monomicrobial profiles, dominated by one single taxon, indicate a disruption in the microbial balance and therefore severe dysbiosis. Figure 4c,d shows that monomicrobial samples exhibited higher DNA concentrations than the multi-microbial or low-biomass ones, suggesting that these parameters could further aid in evaluating the condition of the lung microbiome. However, the classification is somewhat artificial as it oversimplifies the lung microbiome’s complex composition.
Culture-based standard diagnostics delivered results within 48 h compared to Illumina sequencing with 32 h. The ONT approach provided accurate results within one clinical working day by accepting a reduced sequence read depth in this study, which facilitates a rapid diagnostic turnaround time and enables early therapeutic intervention.
While 16S rRNA gene amplicon sequencing can identify potential pathogens, culture-based diagnosis also facilitates a deeper phenotypical classification, including the detection of antibiotic resistance and the potential virulence of culturable bacteria.
Overall, our findings align with those of previous research, reinforcing that NGS cannot currently completely replace classical bacterial culture techniques as the diagnostic gold standard for microbial detection in LRTI as relevant discrepancies between culture-based methods and NGS remain. However, NGS techniques provide a wealth of data. With increasing knowledge, the improvement of workflows, and decreasing costs, the establishment of NGS gradually comes within reach [22,39,46]. It could prove to be a valuable addition, improving the limited bacterial detection rate of culture, especially with respect to the methods reaching species-level resolution, such as ONT sequencing [27,47,48]. Additionally, NGS can come into play in complex cases, such as in patients pretreated with antibiotics [23,27]. With its fast turnaround time, ONT could play an important role as a point-of-care instrument. Additionally, features in the microbiome (culture-independent indices of infection) could serve as diagnostic or prognostic biomarkers to guide concrete clinical decisions [29,49].
Microbiome studies, including this investigation, have inherent strengths and limitations that influence their findings and interpretations. Our study benefited from a large patient cohort. We did not include a healthy control group; however, the primary objective of this study was to compare the different methods rather than to investigate specific microbiome characteristics associated with disease. A common challenge of 16S rRNA gene sequencing, especially in lower respiratory tract samples, is the high risk of contamination during laboratory procedures [2,9,31,50], which was also observed in our study. To address this, we included serval isolation, no-template, saline solution, and bronchoscope rinse controls. This study employed an observational, cross-sectional design, a common approach in respiratory microbiome research [8,31]; however, longitudinal studies are needed to address questions about causation or the principal (patho-)mechanisms [2,8,31,51]. Additionally, geographical, environmental, and ethnic variations should be investigated further, as current knowledge relies on research populations primarily from more highly developed countries, introducing potential biases [13,52,53].

4. Materials and Methods

4.1. Sample Acquisition and Culture-Based Diagnostics

Bronchoalveolar lavage was carried out in 144 patients as part of the clinical patient care at the University Hospital Giessen and Marburg, Giessen (UKGM Giessen, Germany). A total of 50 µL BALF samples were inoculated onto six distinct agar plates (MacConkey agar No. 3; Columbia agar with sheep blood PLUS; chocolate agar with Vitox; Schaedler anaerobe KV selective agar with lysed horse blood; Schaedler anaerobe agar with sheep blood, haemin, and vitamin K1; and Sabouraud–gentamicin–chloramphenicol 2 agar) each, incubated as recommended by the manufacturer (Thermo Fisher Scientific, Waltham, MA, USA; bioMérieux SA, Marcy-l’Étoile, France). Species identification was performed after 24 to 48 h using MALDI-TOF (MS Prime, bioMérieux SA, Marcy-l’Étoile, France), following quality-controlled standard diagnostic protocols at the Institute of Medical Microbiology diagnostics department. Subsequently, the remaining sample material was stored at −80 °C for NGS. The cultured bacteria of 106 BALF samples were assessed for potential pathogens (Supplementary Table S1).

4.2. DNA Extraction

DNA extraction for 122 BALF samples was performed using the automated extraction platform for nucleic acid purification EMAG (bioMérieux SA, Marcy-l’Étoile, France) following the manufacturer’s instructions. In brief, 250 µL of each sample were combined with Proteinase K to lyse cellular material, viral particles, bacteria, and fungi. Nucleases were inactivated using lysis buffer. Nucleic acids were isolated and purified via magnetic silica particles and wash buffers and eluted in elution buffer. Genomic DNA (gDNA) was then quantified using the Quant-iT PicoGreen dsDNA Assay Kit (Invitrogen, Carlsbad, CA, USA) and the Qubit High Sensitivity Kit (Invitrogen, Carlsbad, CA, USA).

4.3. Control of Contaminations

To monitor contaminations and to verify laboratory processes, isolation and no-template controls were included for each processed batch of samples. To assess contamination during the lavages, four bronchoscopy rinse controls and four saline solution controls of the flush solution were gathered over four days and sequenced.
Of the controls included in the Illumina short-read samples, all the bronchoscopy rinse controls and saline solutions controls were not further analysed because of the low number of usable reads contained in those samples. The remaining controls contained between 550 and 7824 sequence reads and were employed to determine possible contaminations (Supplementary Figure S4).
ONT long-read sequencing yielded reads in two no-template controls, with 89 and 192 sequence reads, respectively, and an isolation control with 383 sequence reads; otherwise, the controls were below 20 classified sequence reads and were excluded (Supplementary Figure S5).

4.4. Short-Read Sequencing with Illumina

Library preparation for Illumina short-read sequencing was executed according to [54]. Pipetting steps were automated with the Hamilton Microlab STAR (HAMILTON Bonaduz AG, Bonaduz, Switzerland). Amplification of the V4-region of the 16S rRNA gene was performed with 10 µL of input material and 30 cycles for the PCR (2 min at 98 °C, 30 cycles with 10 s at 98 °C, 10 s at 55 °C and 30 s at 72 °C, plus a final step for 5 min at 72 °C). A total of 5 µL of each library was further amplified with a custom set of 10 µmol unique dual indices (Integrated DNA Technologies, Coralville, IA, USA) and NEBNext Ultra II Q5 Master Mix (New England Biolabs, Ipswich, MA, USA) according to the manufacturer’s protocol for NGS PCR to facilitate multiplex sequencing.
Both amplicons and libraries were purified with AMPure XP Reagent (Beckman Coulter, Pasadena, CA, USA) and eluted in Ambion nuclease-free water (Life Technologies, Carlsbad, CA, USA). The quantity and quality of the libraries were assessed with the Fragment Analyzer (Agilent Technologies Inc., Santa Clara, CA, USA), qPCR (Applied Biosystems, Waltham, MA, USA), and the 2100 Bioanalyzer (Agilent Technologies Inc., Santa Clara, CA, USA). Libraries were pooled in equimolar ratios and diluted to a final concentration of 4 nM. Paired-end reads were sequenced on the MiSeq system (Illumina, San Diego, CA, USA) using the MiSeq Reagent Nano Kit v2 (500-cycles) (Illumina, San Diego, CA, USA) and 20% of PhiX Control v3 (Illumina, San Diego, CA, USA). The generated reads were image processed, base called, and demultiplexed prior to analysis.

4.5. Bioinformatic Analysis of Illumina Short-Read Sequencing Data

Microbiome analysis was conducted using Mothur (version 1.48.3) [55]. Paired-end reads were merged, and primer regions were removed and filtered to retain an amplicon length of 253 nt ± 10 nt. Sequence reads with ambiguous nucleotides were excluded. Joined paired-end sequence reads were aligned to the SILVA ribosomal RNA gene database (version 138.2) [56] and trimmed to include only the hypervariable region V4. Clustering was performed with a 97% similarity threshold. After chimaeras were removed using VSEARCH (version 2.17.1) [57], operational taxonomic units (OTUs) were determined and classified using the SILVA ribosomal RNA gene database. For downstream analysis, all samples were subsampled to 1.400 sequence reads, and 16 samples with less than 1400 sequence reads were excluded. Mothur was used to calculate α-diversity indices. The representative sequence for each OTU was further analysed by BLASTn (version 2.12.0) [34] against the 16S ribosomal RNA database from the NCBI RefSeq Targeted Loci Project [58]. A total of 32 OTUs that overlapped with the negative controls were flagged as contaminations and removed from further analysis (Supplementary Table S4). Finally, samples were classified as monomicrobial (a single genus above 50%), polymicrobial (maximal three genera representing 50%), and multi-microbial (any other composition above 20%). Samples with less than 20% of the sequence reads remaining after contamination removal were categorised as low-biomass samples.

4.6. Long-Read Sequencing with Oxford Nanopore Technologies

To amplify the entire 16S rRNA gene from the remaining extracted gDNA of the 102 BALF samples, the 16S Barcoding Kit 24 V14 was used according to the manufacturer’s protocol (Oxford Nanopore Technologies, Oxford, UK). For samples with a DNA yield lower than recommended, 15 µL were used as input for the PCR. The libraries before and the after pool normalisation in equimolar ratios were quantified with the Qubit High Sensitivity Kit (Invitrogen, Carlsbad, CA, USA). Samples were sequenced on the MinION Mk1B (Oxford Nanopore Technologies, Oxford, UK) with corresponding MinION Flow Cells (Oxford Nanopore Technologies, Oxford, UK). Between sequencing steps, the flow cell was washed and stored according to instructions for the Flow Cell Wash Kit (Oxford Nanopore Technologies, Oxford, UK). All reagents and flow cells used were compatible with the current chemistry type R10.4.1.

4.7. Bioinformatic Analysis of ONT Long-Read Sequencing Data

To assess the quality of long-read sequencing data, fastplong (version 0.3.0) was employed [59]. Sequence reads were filtered for a minimum mean quality of Q20 and a minimum read length of 1000 nucleotides. A total of 21 samples with less than 20 sequence reads remaining after quality control were excluded from this study. Detected adapter and primer sequences were removed. Emu (version 3.5.1) [60] was used to classify the resulting reads to species-level with a combination of the rrnDB [61] and 16S RefSeq records from the targeted loci project [58] as reference database. Reads with the classification Delftia acidovorans, Escherichia coli, Paracoccus angustae, and Paracoccus marinus were removed as possible contaminations. Samples were classified as mono-/poly-/multi-microbial and low-biomass, as described for the analyses of Illumina short-read sequencing.

4.8. Comparitive Microbiome Analysis

DNA burden and diversity indices were evaluated alongside the culture results. The overlap between the findings from culture-based diagnostics and Illumina short-read sequencing was then assessed in two ways. The first approach involved creating a table to match the species identified in the culture with their respective genera, resembling the analysis conducted by Zachariah et al. The second approach categorised microbiome profiles into four groups (mono-/poly-/multi-microbial and low-biomass) and compared the results to culture outcomes. Subsequently, the microbiome profiles of the full-length 16S rRNA gene of the selected samples were analysed at the species level. Lastly, results from the culture, Illumina short-read, and ONT long-read sequencing were juxtaposed.

4.9. Statistic Evaluation

Exploratory data analysis was performed. Statistical significance was set at p < 0.05, with a threshold of p < 0.1 for preliminary tests (e.g., normality tests). The Kruskal–Wallis test was used to compare three or more independent groups. Dunn’s post hoc test with Bonferroni correction was applied for multiple comparisons. Data were log-transformed (base 2) to stabilise variance. Statistical analyses and figures were generated using GraphPad Prism (version 10.2.3).

5. Conclusions

Next-generation sequencing is a promising tool for LRTI diagnostics in clinical microbiology. First, advancing the speed and efficiency of NGS protocols is crucial to facilitate real-time diagnostics, which would enable clinicians to make quicker decisions in acute care settings. Additionally, leveraging NGS to explore the human microbiome could yield insights into disease susceptibility and reveal novel pathogen–host interactions, potentially leading to the identification of previously unknown infectious agents. Furthermore, utilising NGS to monitor and understand antimicrobial resistance—e.g., applying metagenomic approaches—could significantly enhance our ability to track resistance patterns and inform more effective treatment strategies and public health policies. Personalised medicine stands to benefit greatly from NGS by using individual genetic and LRTI microbiome profiles to tailor treatments, offering precision diagnostics that account for personal variability in disease response. Improving accessibility and reducing the cost of NGS technology is also vital, making it feasible for widespread use. Finally, collaborative efforts across disciplines—including genomics, bioinformatics, and clinical medicine—are essential to translate these research advancements into practical and impactful clinical applications.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms26209841/s1: References [62,63,64,65,66,67,68,69,70,71] are cited in the supplementary materials.

Author Contributions

Conceptualisation, C.I. and T.H.; methodology, M.B., J.P.M., B.O., M.W., K.T. and T.H.; software, M.W.; validation, M.B., M.W., A.C.W., K.T. and T.H.; formal analysis, M.B., M.W. and T.H.; investigation, M.B., M.W., B.O., C.I. and T.H.; resources, K.T., C.I. and T.H.; data curation, M.B., M.W. and T.H.; writing—original draft preparation, M.B., M.W. and T.H.; writing—review and editing, M.B., M.W., J.P.M., B.O., A.C.W., C.I. and T.H.; visualisation, M.B., B.O. and M.W.; supervision, C.I. and T.H.; project administration, M.B., M.W., C.I. and T.H.; funding, M.B., M.W. and T.H. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by grants from the German Research Foundation (DFG, SFB-TR84 B08) to M.W. and T.H., from the HMWK LOEWE Research Cluster Diffusible Signals project B3 to T.H., and from the JLU TRAINEE of the Faculty of Medicine at the Justus Liebig University Giessen programme to. M.B.

Institutional Review Board Statement

This study’s ethical approval was obtained from the ethics committee of the Justus Liebig University Giessen, Faculty of Medicine (AZ 158/20, 21 August 2020).

Informed Consent Statement

Patient consent was waived due to following reason: The samples collected were solely for clinical care purposes. All patient data remains anonymized, thereby negating the need for ethics approval, as the tasks associated with these surplus samples are integral to improving clinical microbial diagnostics.

Data Availability Statement

Microbiome sequencing data have been submitted to the NCBI BioProject database under the accession number PRJNA1276981 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1276981, accessed on 26 September 2025).

Acknowledgments

We thank Alexandra Amend for the technical assistance.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BALFBronchoalveolar lavage fluid
IQRInterquartile range
LRTILower respiratory tract infection
NGSNext-generation sequencing
ONTOxford Nanopore Technologies
OTUOperational taxonomic unit

References

  1. Marsland, B.J.; Trompette, A.; Gollwitzer, E.S. The Gut–Lung Axis in Respiratory Disease. Ann. Am. Thorac. Soc. 2015, 12, S150–S156. [Google Scholar] [CrossRef]
  2. Natalini, J.G.; Singh, S.; Segal, L.N. The Dynamic Lung Microbiome in Health and Disease. Nat. Rev. Microbiol. 2023, 21, 222–235. [Google Scholar] [CrossRef]
  3. Beck, J.M.; Young, V.B.; Huffnagle, G.B. The Microbiome of the Lung. Transl. Res. 2012, 160, 258–266. [Google Scholar] [CrossRef] [PubMed]
  4. Fromentin, M.; Ricard, J.-D.; Roux, D. Respiratory Microbiome in Mechanically Ventilated Patients: A Narrative Review. Intensive Care Med. 2021, 47, 292–306. [Google Scholar] [CrossRef] [PubMed]
  5. Huffnagle, G.B.; Dickson, R.P.; Lukacs, N.W. The Respiratory Tract Microbiome and Lung Inflammation: A Two-Way Street. Mucosal Immunol. 2017, 10, 299–306. [Google Scholar] [CrossRef] [PubMed]
  6. Mendez, R.; Banerjee, S.; Bhattacharya, S.K.; Banerjee, S. Lung Inflammation and Disease: A Perspective on Microbial Homeostasis and Metabolism. IUBMB Life 2019, 71, 152–165. [Google Scholar] [CrossRef]
  7. Moffatt, M.F.; Cookson, W.O. The Lung Microbiome in Health and Disease. Clin. Med. 2017, 17, 525–529. [Google Scholar] [CrossRef]
  8. Yagi, K.; Huffnagle, G.B.; Lukacs, N.W.; Asai, N. The Lung Microbiome during Health and Disease. Int. J. Mol. Sci. 2021, 22, 10872. [Google Scholar] [CrossRef]
  9. Martin-Loeches, I.; Dickson, R.; Torres, A.; Hanberger, H.; Lipman, J.; Antonelli, M.; de Pascale, G.; Bozza, F.; Vincent, J.L.; Murthy, S.; et al. The Importance of Airway and Lung Microbiome in the Critically Ill. Crit. Care 2020, 24, 537. [Google Scholar] [CrossRef]
  10. Wu, B.G.; Segal, L.N. The Lung Microbiome and Its Role in Pneumonia. Clin. Chest Med. 2018, 39, 677–689. [Google Scholar] [CrossRef]
  11. Dickson, R.P.; Erb-Downward, J.R.; Martinez, F.J.; Huffnagle, G.B. The Microbiome and the Respiratory Tract. Annu. Rev. Physiol. 2016, 78, 481–504. [Google Scholar] [CrossRef] [PubMed]
  12. Molyneaux, P.L.; Mallia, P.; Cox, M.J.; Footitt, J.; Willis-Owen, S.A.G.; Homola, D.; Trujillo-Torralbo, M.-B.; Elkin, S.; Kon, O.M.; Cookson, W.O.C.; et al. Outgrowth of the Bacterial Airway Microbiome after Rhinovirus Exacerbation of Chronic Obstructive Pulmonary Disease. Am. J. Respir. Crit. Care Med. 2013, 188, 1224. [Google Scholar] [CrossRef] [PubMed]
  13. Man, W.H.; de Steenhuijsen Piters, W.A.A.; Bogaert, D. The Microbiota of the Respiratory Tract: Gatekeeper to Respiratory Health. Nat. Rev. Microbiol. 2017, 15, 259–270. [Google Scholar] [CrossRef]
  14. GBD 2016 Lower Respiratory Infections Collaborators Estimates of the Global, Regional, and National Morbidity, Mortality, and Aetiologies of Lower Respiratory Infections in 195 Countries, 1990–2016: A Systematic Analysis for the Global Burden of Disease Study 2016. Lancet Infect. Dis. 2018, 18, 1191–1210. [CrossRef]
  15. Bender, R.G.; Sirota, S.B.; Swetschinski, L.R.; Dominguez, R.-M.V.; Novotney, A.; Wool, E.E.; Ikuta, K.S.; Vongpradith, A.; Rogowski, E.L.B.; Doxey, M.; et al. Global, Regional, and National Incidence and Mortality Burden of Non-COVID-19 Lower Respiratory Infections and Aetiologies, 1990–2021: A Systematic Analysis from the Global Burden of Disease Study 2021. Lancet Infect. Dis. 2024, 24, 974–1002. [Google Scholar] [CrossRef]
  16. Kalantar, K.L.; Langelier, C.R. Host-Microbe Metagenomics: A Lens To Refocus Our Perspective on Infectious and Inflammatory Diseases. mSystems 2021, 6, e0040421. [Google Scholar] [CrossRef]
  17. Shoar, S.; Musher, D.M. Etiology of Community-Acquired Pneumonia in Adults: A Systematic Review. Pneumonia 2020, 12, 11. [Google Scholar] [CrossRef]
  18. Gu, W.; Deng, X.; Lee, M.; Sucu, Y.D.; Arevalo, S.; Stryke, D.; Federman, S.; Gopez, A.; Reyes, K.; Zorn, K.; et al. Rapid Pathogen Detection by Metagenomic Next-Generation Sequencing of Infected Body Fluids. Nat. Med. 2021, 27, 115–124. [Google Scholar] [CrossRef]
  19. Zhu, N.; Zhou, D.; Yuan, R.; Ruzetuoheti, Y.; Li, J.; Zhang, X.; Li, S. Identification and Comparison of Chlamydia Psittaci, Legionella and Mycoplasma Pneumonia Infection. Clin. Respir. J. 2023, 17, 384–393. [Google Scholar] [CrossRef]
  20. Laboratory Diagnosis of Infectious Diseases. Available online: https://accessmedicine.mhmedical.com/content.aspx?sectionid=265477006&bookid=3095 (accessed on 10 September 2025).
  21. Bador, J.; Nicolas, B.; Chapuis, A.; Varin, V.; Dullier-Taillefumier, N.; de Curraize, C.; Amoureux, L.; Putot, A.; Neuwirth, C. 16S rRNA PCR on Clinical Specimens: Impact on Diagnosis and Therapeutic Management. Med. Mal. Infect. 2020, 50, 63–73. [Google Scholar] [CrossRef] [PubMed]
  22. Charalampous, T.; Kay, G.L.; Richardson, H.; Aydin, A.; Baldan, R.; Jeanes, C.; Rae, D.; Grundy, S.; Turner, D.J.; Wain, J.; et al. Nanopore Metagenomics Enables Rapid Clinical Diagnosis of Bacterial Lower Respiratory Infection. Nat. Biotechnol. 2019, 37, 783–792. [Google Scholar] [CrossRef]
  23. Kitsios, G.D. Translating Lung Microbiome Profiles into the Next-Generation Diagnostic Gold Standard for Pneumonia: A Clinical Investigator’s Perspective. mSystems 2018, 3, e00153-17. [Google Scholar] [CrossRef]
  24. Mahnic, A.; Breznik, V.; Bombek Ihan, M.; Rupnik, M. Comparison Between Cultivation and Sequencing Based Approaches for Microbiota Analysis in Swabs and Biopsies of Chronic Wounds. Front. Med. 2021, 8, 607255. [Google Scholar] [CrossRef] [PubMed]
  25. Dickson, R.P.; Erb-Downward, J.R.; Prescott, H.C.; Martinez, F.J.; Curtis, J.L.; Lama, V.N.; Huffnagle, G.B. Analysis of Culture-Dependent versus Culture-Independent Techniques for Identification of Bacteria in Clinically Obtained Bronchoalveolar Lavage Fluid. J. Clin. Microbiol. 2014, 52, 3605–3613. [Google Scholar] [CrossRef] [PubMed]
  26. Yang, J.; Li, J.; Zhang, L.; Shen, Z.; Xiao, Y.; Zhang, G.; Chen, M.; Chen, F.; Liu, L.; Wang, Y.; et al. Highly Diverse Sputum Microbiota Correlates with the Disease Severity in Patients with Community-Acquired Pneumonia: A Longitudinal Cohort Study. Respir. Res. 2024, 25, 223. [Google Scholar] [CrossRef] [PubMed]
  27. Yoo, I.Y.; Kang, O.K.; Lee, M.K.; Kim, Y.J.; Cho, S.Y.; Huh, K.; Kang, C.I.; Chung, D.R.; Peck, K.R.; Huh, H.J.; et al. Comparison of 16S Ribosomal RNA Targeted Sequencing and Culture for Bacterial Identification in Normally Sterile Body Fluid Samples: Report of a 10-Year Clinical Laboratory Review. Ann. Lab. Med. 2020, 40, 63–67. [Google Scholar] [CrossRef]
  28. Zachariah, P.; Ryan, C.; Nadimpalli, S.; Coscia, G.; Kolb, M.; Smith, H.; Foca, M.; Saiman, L.; Planet, P.J. Culture-Independent Analysis of Pediatric Bronchoalveolar Lavage Specimens. Ann. Am. Thorac. Soc. 2018, 15, 1047–1056. [Google Scholar] [CrossRef]
  29. Dickson, R.P.; Schultz, M.J.; van der Poll, T.; Schouten, L.R.; Falkowski, N.R.; Luth, J.E.; Sjoding, M.W.; Brown, C.A.; Chanderraj, R.; Huffnagle, G.B.; et al. Lung Microbiota Predict Clinical Outcomes in Critically Ill Patients. Am. J. Respir. Crit. Care Med. 2020, 201, 555–563. [Google Scholar] [CrossRef]
  30. Li, Y.; Sun, B.; Tang, X.; Liu, Y.-L.; He, H.-Y.; Li, X.-Y.; Wang, R.; Guo, F.; Tong, Z.-H. Application of Metagenomic Next-Generation Sequencing for Bronchoalveolar Lavage Diagnostics in Critically Ill Patients. Eur. J. Clin. Microbiol. Infect. Dis. 2020, 39, 369–374. [Google Scholar] [CrossRef]
  31. Carney, S.M.; Clemente, J.C.; Cox, M.J.; Dickson, R.P.; Huang, Y.J.; Kitsios, G.D.; Kloepfer, K.M.; Leung, J.M.; LeVan, T.D.; Molyneaux, P.L.; et al. Methods in Lung Microbiome Research. Am. J. Respir. Cell Mol. Biol. 2020, 62, 283–299. [Google Scholar] [CrossRef]
  32. Wang, J.; Han, Y.; Feng, J. Metagenomic Next-Generation Sequencing for Mixed Pulmonary Infection Diagnosis. BMC Pulm. Med. 2019, 19, 252. [Google Scholar] [CrossRef] [PubMed]
  33. Xie, F.; Duan, Z.; Zeng, W.; Xie, S.; Xie, M.; Fu, H.; Ye, Q.; Xu, T.; Xie, L. Clinical Metagenomics Assessments Improve Diagnosis and Outcomes in Community-Acquired Pneumonia. BMC Infect. Dis. 2021, 21, 352. [Google Scholar] [CrossRef]
  34. Altschul, S.F.; Madden, T.L.; Schäffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar] [CrossRef] [PubMed]
  35. Dickson, R.P.; Erb-Downward, J.R.; Freeman, C.M.; McCloskey, L.; Falkowski, N.R.; Huffnagle, G.B.; Curtis, J.L. Bacterial Topography of the Healthy Human Lower Respiratory Tract. mBio 2017, 8, e02287-16. [Google Scholar] [CrossRef] [PubMed]
  36. Wypych, T.P.; Wickramasinghe, L.C.; Marsland, B.J. The Influence of the Microbiome on Respiratory Health. Nat. Immunol. 2019, 20, 1279–1290. [Google Scholar] [CrossRef]
  37. Gill, C.; van de Wijgert, J.H.H.M.; Blow, F.; Darby, A.C. Evaluation of Lysis Methods for the Extraction of Bacterial DNA for Analysis of the Vaginal Microbiota. PLoS ONE 2016, 11, e0163148. [Google Scholar] [CrossRef]
  38. Knight, R.; Vrbanac, A.; Taylor, B.C.; Aksenov, A.; Callewaert, C.; Debelius, J.; Gonzalez, A.; Kosciolek, T.; McCall, L.-I.; McDonald, D.; et al. Best Practices for Analysing Microbiomes. Nat. Rev. Microbiol. 2018, 16, 410–422. [Google Scholar] [CrossRef]
  39. Allaband, C.; McDonald, D.; Vázquez-Baeza, Y.; Minich, J.J.; Tripathi, A.; Brenner, D.A.; Loomba, R.; Smarr, L.; Sandborn, W.J.; Schnabl, B.; et al. Microbiome 101: Studying, Analyzing, and Interpreting Gut Microbiome Data for Clinicians. Clin. Gastroenterol. Hepatol. 2019, 17, 218–230. [Google Scholar] [CrossRef]
  40. Drengenes, C.; Eagan, T.M.L.; Haaland, I.; Wiker, H.G.; Nielsen, R. Exploring Protocol Bias in Airway Microbiome Studies: One versus Two PCR Steps and 16S rRNA Gene Region V3 V4 versus V4. BMC Genom. 2021, 22, 3. [Google Scholar] [CrossRef]
  41. Lao, H.-Y.; Wong, L.L.-Y.; Hui, Y.; Ng, T.T.-L.; Chan, C.T.-M.; Lo, H.W.-H.; Yau, M.C.-Y.; Leung, E.C.-M.; Wong, R.C.-W.; Ho, A.Y.-M.; et al. The Clinical Utility of Nanopore 16S rRNA Gene Sequencing for Direct Bacterial Identification in Normally Sterile Body Fluids. Front. Microbiol. 2024, 14, 1324494. [Google Scholar] [CrossRef]
  42. Lai, L.M.; Zhu, X.Y.; Zhao, R.; Chen, Q.; Liu, J.J.; Liu, Y.; Yuan, L. Tropheryma Whipplei Detected by Metagenomic Next-Generation Sequencing in Bronchoalveolar Lavage Fluid. Diagn. Microbiol. Infect. Dis. 2024, 109, 116374, Erratum in Diagn. Microbiol. Infect. Dis. 2024, 110, 116393. [Google Scholar] [CrossRef]
  43. Morgand, M.; Leclercq, A.; Maury, M.M.; Bracq-Dieye, H.; Thouvenot, P.; Vales, G.; Lecuit, M.; Charlier, C. Listeria Monocytogenes-Associated Respiratory Infections: A Study of 38 Consecutive Cases. Clin. Microbiol. Infect. 2018, 24, 1339.e1–1339.e5. [Google Scholar] [CrossRef]
  44. Chen, Y.; Feng, W.; Ye, K.; Guo, L.; Xia, H.; Guan, Y.; Chai, L.; Shi, W.; Zhai, C.; Wang, J.; et al. Application of Metagenomic Next-Generation Sequencing in the Diagnosis of Pulmonary Infectious Pathogens From Bronchoalveolar Lavage Samples. Front. Cell. Infect. Microbiol. 2021, 11, 541092. [Google Scholar] [CrossRef]
  45. Langelier, C.; Zinter, M.S.; Kalantar, K.; Yanik, G.A.; Christenson, S.; O’Donovan, B.; White, C.; Wilson, M.; Sapru, A.; Dvorak, C.C.; et al. Metagenomic Sequencing Detects Respiratory Pathogens in Hematopoietic Cellular Transplant Patients. Am. J. Respir. Crit. Care Med. 2018, 197, 524–528. [Google Scholar] [CrossRef] [PubMed]
  46. Chapman, R.; Jones, L.; D’Angelo, A.; Suliman, A.; Anwar, M.; Bagby, S. Nanopore-Based Metagenomic Sequencing in Respiratory Tract Infection: A Developing Diagnostic Platform. Lung 2023, 201, 171–179. [Google Scholar] [CrossRef]
  47. Ma, L.; Zhu, C.; Yan, T.; Hu, Y.; Zhou, J.; Li, Y.; Du, F.; Zhou, J. Illumina and Nanopore Sequencing in Culture-Negative Samples from Suspected Lower Respiratory Tract Infection Patients. Front. Cell. Infect. Microbiol. 2024, 14, 1230650. [Google Scholar] [CrossRef]
  48. Ye, J.; Huang, K.; Xu, Y.; Chen, N.; Tu, Y.; Huang, J.; Shao, L.; Kong, W.; Zhao, D.; Xie, Y. Clinical Application of Nanopore-Targeted Sequencing Technology in Bronchoalveolar Lavage Fluid from Patients with Pulmonary Infections. Microbiol. Spectr. 2024, 12, e00026-24. [Google Scholar] [CrossRef]
  49. Sun, G.; Liu, W.; Zheng, Q.; Shan, Q.; Hou, H. Ratio of Procalcitonin/Simpson’s Dominance Index Predicted the Short-Term Prognosis of Patients with Severe Bacterial Pneumonia. Front. Cell. Infect. Microbiol. 2023, 13, 1175747. [Google Scholar] [CrossRef] [PubMed]
  50. Salter, S.J.; Cox, M.J.; Turek, E.M.; Calus, S.T.; Cookson, W.O.; Moffatt, M.F.; Turner, P.; Parkhill, J.; Loman, N.J.; Walker, A.W. Reagent and Laboratory Contamination Can Critically Impact Sequence-Based Microbiome Analyses. BMC Biol. 2014, 12, 87. [Google Scholar] [CrossRef] [PubMed]
  51. Pérez-Cobas, A.E.; Ginevra, C.; Rusniok, C.; Jarraud, S.; Buchrieser, C. The Respiratory Tract Microbiome, the Pathogen Load, and Clinical Interventions Define Severity of Bacterial Pneumonia. Cell Rep. Med. 2023, 4, 101167. [Google Scholar] [CrossRef]
  52. Abdill, R.J.; Adamowicz, E.M.; Blekhman, R. Public Human Microbiome Data Are Dominated by Highly Developed Countries. PLoS Biol. 2022, 20, e3001536. [Google Scholar] [CrossRef]
  53. Gaulke, C.A.; Sharpton, T.J. The Influence of Ethnicity and Geography on Human Gut Microbiome Composition. Nat. Med. 2018, 24, 1495–1496. [Google Scholar] [CrossRef]
  54. Dabrowski, A.N.; Shrivastav, A.; Conrad, C.; Komma, K.; Weigel, M.; Dietert, K.; Gruber, A.D.; Bertrams, W.; Wilhelm, J.; Schmeck, B.; et al. Peptidoglycan Recognition Protein 4 Limits Bacterial Clearance and Inflammation in Lungs by Control of the Gut Microbiota. Front. Immunol. 2019, 10, 2106. [Google Scholar] [CrossRef]
  55. Kozich, J.J.; Westcott, S.L.; Baxter, N.T.; Highlander, S.K.; Schloss, P.D. Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform. Appl. Environ. Microbiol. 2013, 79, 5112–5120. [Google Scholar] [CrossRef]
  56. Quast, C.; Pruesse, E.; Yilmaz, P.; Gerken, J.; Schweer, T.; Yarza, P.; Peplies, J.; Glöckner, F.O. The SILVA Ribosomal RNA Gene Database Project: Improved Data Processing and Web-Based Tools. Nucleic Acids Res. 2013, 41, D590–D596. [Google Scholar] [CrossRef]
  57. Rognes, T.; Flouri, T.; Nichols, B.; Quince, C.; Mahé, F. VSEARCH: A Versatile Open Source Tool for Metagenomics. PeerJ 2016, 4, e2584. [Google Scholar] [CrossRef] [PubMed]
  58. NCBI RefSeq Targeted Loci Project. Available online: https://www.ncbi.nlm.nih.gov/refseq/targetedloci/ (accessed on 29 February 2024).
  59. Available online: https://github.com/OpenGene/fastplong (accessed on 30 April 2025).
  60. Curry, K.D.; Wang, Q.; Nute, M.G.; Tyshaieva, A.; Reeves, E.; Soriano, S.; Wu, Q.; Graeber, E.; Finzer, P.; Mendling, W.; et al. Emu: Species-Level Microbial Community Profiling of Full-Length 16S rRNA Oxford Nanopore Sequencing Data. Nat. Methods 2022, 19, 845–853. [Google Scholar] [CrossRef] [PubMed]
  61. Roller, B.R.K.; Stoddard, S.F.; Schmidt, T.M. Exploiting rRNA Operon Copy Number to Investigate Bacterial Reproductive Strategies. Nat. Microbiol. 2016, 1, 16160. [Google Scholar] [CrossRef] [PubMed]
  62. Gujju, V.R.; Akram, B.; Shibib, D.R.; McGhee, M.A.; Drevets, D.A. Bordetella Bronchiseptica Infections in Patients with HIV/AIDS: A Case Report and Review of the Literature. Medicine 2021, 100, e28244. [Google Scholar] [CrossRef]
  63. Yao, Y.; Falgenhauer, L.; Falgenhauer, J.; Hauri, A.M.; Heinmüller, P.; Domann, E.; Chakraborty, T.; Imirzalioglu, C. Carbapenem-Resistant Citrobacter Spp. as an Emerging Concern in the Hospital-Setting: Results From a Genome-Based Regional Surveillance Study. Front. Cell. Infect. Microbiol. 2021, 11, 744431. [Google Scholar] [CrossRef]
  64. Dalhoff, K.; Abele-Horn, M.; Andreas, S.; Deja, M.; Ewig, S.; Gastmeier, P.; Gatermann, S.; Gerlach, H.; Grabein, B.; Heußel, C.; et al. S3-Leitlinie zur Epidemiologie, Diagnostik und Therapie erwachsener Patienten mit nosokomialer Pneumonie–Update 2017. Available online: https://register.awmf.org/assets/guidelines/020-013l_S3_Nosokomiale_Pneumonie_Erwachsener_2017-11.pdf (accessed on 27 January 2024).
  65. Ewig, S.; Kolditz, M.; Pletz, M.; Altiner, A.; Albrich, W.; Droemann, D.; Flick, H.; Gatermann, S.; Krüger, S.; Nehls, W.; et al. S3-Leitlinie zur Behandlung von Erwachsenen Patienten mit Ambulant Erworbener Pneumonie–Update 2021. Available online: https://register.awmf.org/assets/guidelines/020-020l_S3_Behandlung-von-erwachsenen-Patienten-mit-ambulant-erworbener-Pneumonie__2021-05.pdf (accessed on 27 January 2024).
  66. Koufakis, T.; Chatzopoulou, M.; Margaritis, A.; Tsiakalou, M.; Gabranis, I. Pneumonia by Listeria Monocytogenes: A Common Infection by an Uncommon Pathogen. Case Rep. Infect. Dis. 2015, 2015, 627073. [Google Scholar] [CrossRef] [PubMed]
  67. Viswanath, L.S.; Gunalan, A.; Jamir, I.; S, B.; S, A.; K, A.; Biswas, R. Prevotella oris: A Lesser Known Etiological Agent of Pleural Effusion. Anaerobe 2022, 78, 102644. [Google Scholar] [CrossRef]
  68. Ryan, M.P.; Pembroke, J.T.; Adley, C.C. Differentiating the Growing Nosocomial Infectious Threats Ralstonia Pickettii and Ralstonia Insidiosa. Eur. J. Clin. Microbiol. Infect. Dis. 2011, 30, 1245–1247. [Google Scholar] [CrossRef]
  69. Furuta, A.; Brokaw, A.; Manuel, G.; Dacanay, M.; Marcell, L.; Seepersaud, R.; Rajagopal, L.; Adams Waldorf, K. Bacterial and Host Determinants of Group B Streptococcal Infection of the Neonate and Infant. Front. Microbiol. 2022, 13, 820365. [Google Scholar] [CrossRef]
  70. Garriss, G.; Nannapaneni, P.; Simões, A.S.; Browall, S.; Subramanian, K.; Sá-Leão, R.; Goossens, H.; de Lencastre, H.; Henriques-Normark, B. Genomic Characterization of the Emerging Pathogen Streptococcus pseudopneumoniae. mBio 2019, 10, e01286-19. [Google Scholar] [CrossRef] [PubMed]
  71. Shi, J.; Liu, R.; Qiu, J.; Wei, C.; Pan, D.; Xiang, T.; Cheng, N. Pulmonary Infection Caused by Tropheryma Whipplei: A Case Report and Review of the Literature. J. Med. Case Rep. 2024, 18, 613. [Google Scholar] [CrossRef] [PubMed]
Figure 1. A total of 144 BALF samples were collected and have undergone routine diagnostics. Of those, 122 samples were used for Illumina short-read sequencing and 102 for ONT long-read sequencing, resulting in 106 and 81 samples being included in this study, respectively.
Figure 1. A total of 144 BALF samples were collected and have undergone routine diagnostics. Of those, 122 samples were used for Illumina short-read sequencing and 102 for ONT long-read sequencing, resulting in 106 and 81 samples being included in this study, respectively.
Ijms 26 09841 g001
Figure 2. Taxonomic bar charts of Illumina short-read sequencing of the V4 region of the 16S rRNA gene after the removal of detected contaminants. The missing parts of the bars indicate the proportion of contamination removed.
Figure 2. Taxonomic bar charts of Illumina short-read sequencing of the V4 region of the 16S rRNA gene after the removal of detected contaminants. The missing parts of the bars indicate the proportion of contamination removed.
Ijms 26 09841 g002
Figure 3. Taxonomic bar charts of ONT long-read sequencing of the full-length 16S rRNA gene after the removal of detected contaminants limited to genus resolution. The missing parts of the bars indicate the proportion of contamination removed.
Figure 3. Taxonomic bar charts of ONT long-read sequencing of the full-length 16S rRNA gene after the removal of detected contaminants limited to genus resolution. The missing parts of the bars indicate the proportion of contamination removed.
Ijms 26 09841 g003
Figure 4. Overview of DNA concentration (a) and Shannon diversity index for Illumina short-read sequencing results (b) from extracted BALF samples, which are sorted by culture result and plotted with their medians. DNA concentration of the extracts was sorted by microbiome profile category for Illumina short-read sequencing (c) and ONT long-read sequencing (d). DNA concentration is plotted on a logarithmic scale (base two). Asterisks indicate statistically significant differences. * = p ≤ 0.05; ** = p ≤ 0.01; *** = p ≤ 0.001; **** = p ≤ 0.0001.
Figure 4. Overview of DNA concentration (a) and Shannon diversity index for Illumina short-read sequencing results (b) from extracted BALF samples, which are sorted by culture result and plotted with their medians. DNA concentration of the extracts was sorted by microbiome profile category for Illumina short-read sequencing (c) and ONT long-read sequencing (d). DNA concentration is plotted on a logarithmic scale (base two). Asterisks indicate statistically significant differences. * = p ≤ 0.05; ** = p ≤ 0.01; *** = p ≤ 0.001; **** = p ≤ 0.0001.
Ijms 26 09841 g004
Figure 5. Venn diagram showing the overlap of samples containing a potential pathogen between culture-based diagnostic and ONT long-read sequencing on species level. Illumina short-read sequencing has been assessed on genus level.
Figure 5. Venn diagram showing the overlap of samples containing a potential pathogen between culture-based diagnostic and ONT long-read sequencing on species level. Illumina short-read sequencing has been assessed on genus level.
Ijms 26 09841 g005
Table 1. Culture-based diagnostic results listed with the respective categories from Illumina short-read sequencing and ONT long-read sequencing. The below-3% category also includes hits with less than ten sequence reads for ONT sequencing. Potential pathogenic bacteria are marked in bold. E. coli was excluded because of the overlap with detected contamination in NGS.
Table 1. Culture-based diagnostic results listed with the respective categories from Illumina short-read sequencing and ONT long-read sequencing. The below-3% category also includes hits with less than ten sequence reads for ONT sequencing. Potential pathogenic bacteria are marked in bold. E. coli was excluded because of the overlap with detected contamination in NGS.
Culture-Based DiagnosticsCorresponding Genera Detected by
Illumina Short-Read Sequencing
Corresponding Species Detected by
ONT Long-Read Sequencing
SpeciesNo. of
Reports
>20%≥10%
<20%
≥3%
<10%
<3%Absent>20%≥10%
<20%
≥3%
<10%
<3%Absent
A. odontolyticus1---1------
B. bronchiseptica1--1----1--
C. koseri2----22----
E. faecalis21---11---1
E. faecium42-11--1--2
E. coli2----------
H. influenzae11---------
H. parahaemolyticus11----1----
L. reuteri1-1--------
L. salivarius1----1----1
Saproph. Neisseria12--633-1125
N. flava211-------2
P. melaninogenica5-131----31
P. aeruginosa53--111---3
Rothia sp.31-2------2
R. mucilaginosa5131-----22
S. marcescens1----1----1
Coagulase-neg. Staphylococcus92-1242-132
S. aureus1131-431-115
S. epidermidis4-112-1--11
S. maltophilia33----2---1
α-haemolytic
Streptococcus
35101195-163322
S. mitis3-111-----3
S. parasanguinis1--1------1
S. pneumoniae11----1----
S. salivarius1--1----1--
V. parvula1-1------1-
Sum118302128211628571635
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bitter, M.; Weigel, M.; Mengel, J.P.; Ott, B.; Windhorst, A.C.; Tello, K.; Imirzalioglu, C.; Hain, T. Assessment of Microbiome-Based Pathogen Detection Using Illumina Short-Read and Nanopore Long-Read Sequencing in 144 Patients Undergoing Bronchoalveolar Lavage in a University Hospital in Germany. Int. J. Mol. Sci. 2025, 26, 9841. https://doi.org/10.3390/ijms26209841

AMA Style

Bitter M, Weigel M, Mengel JP, Ott B, Windhorst AC, Tello K, Imirzalioglu C, Hain T. Assessment of Microbiome-Based Pathogen Detection Using Illumina Short-Read and Nanopore Long-Read Sequencing in 144 Patients Undergoing Bronchoalveolar Lavage in a University Hospital in Germany. International Journal of Molecular Sciences. 2025; 26(20):9841. https://doi.org/10.3390/ijms26209841

Chicago/Turabian Style

Bitter, Merle, Markus Weigel, Jan Philipp Mengel, Benjamin Ott, Anita C. Windhorst, Khodr Tello, Can Imirzalioglu, and Torsten Hain. 2025. "Assessment of Microbiome-Based Pathogen Detection Using Illumina Short-Read and Nanopore Long-Read Sequencing in 144 Patients Undergoing Bronchoalveolar Lavage in a University Hospital in Germany" International Journal of Molecular Sciences 26, no. 20: 9841. https://doi.org/10.3390/ijms26209841

APA Style

Bitter, M., Weigel, M., Mengel, J. P., Ott, B., Windhorst, A. C., Tello, K., Imirzalioglu, C., & Hain, T. (2025). Assessment of Microbiome-Based Pathogen Detection Using Illumina Short-Read and Nanopore Long-Read Sequencing in 144 Patients Undergoing Bronchoalveolar Lavage in a University Hospital in Germany. International Journal of Molecular Sciences, 26(20), 9841. https://doi.org/10.3390/ijms26209841

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop