Next Article in Journal
Integrated Assessment of Benthic Bacterial Community Physiology, Structure, and Function Across C, N, P, and S Gradients in Lake Villarrica Sediments, Chile
Previous Article in Journal
Competition-Exclusion for Manganese Is Involved in Antifungal Activity of Two Lactic Acid Bacteria Against Various Dairy Spoilage Fungi
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of QIIME1 and QIIME2 for Analyzing Fungal Samples from Various Built Environments

1
Graduate School of Engineering, Kogakuin University, Tokyo 163-8677, Japan
2
School of Architecture, Kogakuin University, Tokyo 163-8677, Japan
*
Author to whom correspondence should be addressed.
Microorganisms 2025, 13(11), 2545; https://doi.org/10.3390/microorganisms13112545
Submission received: 5 October 2025 / Revised: 1 November 2025 / Accepted: 4 November 2025 / Published: 6 November 2025
(This article belongs to the Section Microbiomes)

Abstract

This study evaluates the differences between bioinformatics pipelines by analyzing samples collected from various built environments. Previous comparative studies of microbial community analysis pipelines have largely focused on bacterial communities, mock communities, or soil fungi, often with small sample sizes, and have not specifically targeted built environments. Our results highlight key differences between OTU (QIIME1) and ASV (QIIME2) analyses. OTU analysis clusters OTUs at 97% similarity and tends to show higher diversity values in diversity analyses. Regarding abundantly detected fungi, OTU analysis identified more genera than ASV analysis. However, the OTU method has a high rate of false positives and false negatives, indicating low error-removal capability and suggesting that many fungal genera may have been detected. Therefore, a combined approach using OTU analysis combined with ASV analysis allows for both the comprehensive detection of dominant taxa and the inclusion of rare species. Overall, our findings emphasize that the choice of pipeline significantly influences the composition of the observed fungal community in built environments. Careful consideration of both OTU and ASV strategies can enhance the reliability and completeness of fungal metabarcoding studies, particularly when studying complex indoor microbial communities.

1. Introduction

Fungi comprise an extremely diverse group of organisms, including yeasts, molds, and mushrooms. It is estimated that millions of species exist on Earth, many of which remain undefined [1]. While fungi play an essential role in nature through the decomposition of organic matter and contribute to nutrient cycling, some species also exhibit pathogenicity toward humans, animals, and plants [2,3]. Particularly in human living environments, fungi can potentially cause diverse health impacts, such as acting as allergens [4], producing mycotoxins [5], and causing infections under conditions of immune suppression [6]. Clinically significant genera and species include Aspergillus (A. fumigatus, A. flavus, A. niger) [4,5,6], Penicillium (P. chrysogenum, P. marneffei) [5,6], Cladosporium [5], Alternaria [5], Stachybotrys (S. chartarum) [5], and Fusarium [5,6]. Among these species, A. fumigatus is the primary cause of fungal sensitization in asthma patients and the main causative agent of allergic bronchopulmonary aspergillosis (ABPA) [4]. Major fungal allergens include Asp f 1 Asp f 1, Asp f 2, Asp f 3, Asp f 4 Asp f 5, Asp f 6 derived from A. fumigatus, which can trigger asthma exacerbations and ABPA in sensitized individuals [4]. Up to 50% of severe asthma patients suffer from fungal sensitization, with approximately 6.5 million people worldwide estimated to have severe asthma with fungal sensitization (SAFS) [4]. Additionally, mycotoxins produced by indoor fungi include aflatoxins, ochratoxin A, trichothecenes, patulin, citrinin, and fumonisins, which cause health hazards such as carcinogenicity, immunosuppression, and renal/hepatic toxicity [5]. Aflatoxin B1 is known to be a potent carcinogen, whereas ochratoxin A is recognized for its nephrotoxicity [5]. These effects pose a particular threat to immunocompromised individuals and patients with chronic respiratory diseases, underscoring the importance of fungal control and proper management in indoor environments [4,6]. Accurately understanding the composition and diversity of fungal communities is therefore crucial from the perspectives of public health and environmental sanitation [7].
Fungal identification and analysis methods have traditionally relied on culture-based techniques and morphological observation. However, these approaches are often limited, particularly for species that are difficult to culture or for distinguishing morphologically similar taxa, making it challenging to fully assess fungal diversity. In recent years, advances in molecular biology techniques have established ribosomal DNA sequence analysis as the primary method for fungal identification [8]. Among these analyses, the Internal Transcribed Spacer (ITS) region has been internationally adopted as the standard fungal DNA barcode, with the ITS1 and ITS2 regions serving as the principal targets for metabarcoding analysis [9,10]. With the widespread application of Next-Generation Sequencing (NGS), comprehensive analyses of fungal communities based on these ITS regions have rapidly expanded, revealing the presence of diverse fungi previously undetectable by means of culture methods [11].
Research on fungal communities in built environments such as homes, hospitals, and schools has gained particular attention in recent years [12,13]. Indoor fungal exposure has been reported to be associated with the onset and exacerbation of allergic rhinitis and asthma, in addition to respiratory diseases such as hypersensitivity pneumonia. Thus, analyzing indoor fungal communities is of considerable importance in both environmental epidemiology and clinical medicine [14,15,16]. However, their composition is influenced by a variety of factors, including geographical conditions [17], seasonal variations [18,19], building materials, and ventilation systems [20,21]. In addition, outcomes may vary based on differences in analytical methods and bioinformatics approaches [22,23]. As a result, direct comparison and integration of data across studies remain challenging [24]. Furthermore, with regard to quantitative standards for fungal exposure, the WHO guidelines do not specify quantitative exposure limits such as spore counts or CFU/m3. They clearly state that there is insufficient scientific evidence to establish numerical standards for biological contaminants [25]. Similarly, the EPA guidelines do not define quantitative concentration limits for fungal exposure. Instead, they provide recommendations focusing on moisture and mold control, humidity management, and improved ventilation [26].
To conduct fungal community analysis using NGS data, a bioinformatics pipeline is essential to perform a series of computational steps, including sequence quality control, taxonomic assignment, and diversity metric calculation [27,28,29]. Among the available platforms, QIIME (Quantitative Insights Into Microbial Ecology) is one of the most widely used internationally [30,31,32]. QIIME1, released in 2010, employs an OTU (Operational Taxonomic Unit) clustering approach based on 97% sequence similarity [33,34] and has been applied to the analysis of diverse microbial communities, including bacteria and fungi [33,34,35]. In contrast, QIIME2, which has gained widespread adoption since 2018 [36], enables ASV (Amplicon Sequence Variant)-based processing through tools such as DADA2 and Deblur [37,38,39]. It incorporates multiple improvements, including enhanced sequence error correction accuracy, improved reproducibility, and extensibility, through its plugin-based architecture [28,40].
Mock communities (artificial microbial communities) composed of known species are frequently employed in performance evaluation and comparative studies of analysis pipelines [41,42,43]. They are particularly useful for evaluating taxonomic identification accuracy and sequence processing algorithms [44] and have been widely applied in bacterial 16S analysis [45]. However, the use of mock communities in fungal ITS analysis presents several challenges. First, the ITS region exhibits substantial variation in length and base composition both between and within species [46,47]. This variation results in differences in PCR amplification efficiency and sequence coverage among species, often causing the observed sequence ratios to deviate from the expected community composition [44]. Second, fungal ITS reference databases are less comprehensive than bacterial 16S databases [30,48]. For example, while the UNITE database currently contains approximately 3.8 × 106 ITS sequences and roughly 2.4 × 105 Species Hypotheses, the bacterial 16S reference database (SILVA) contains millions of SSU sequences covering a broader range of taxa. Therefore, even known species included in mock communities may not always be accurately assigned at all taxonomic levels [49,50]. Moreover, the availability of standardized fungal mock communities is limited, making it difficult for researchers to conduct evaluations under identical conditions [51,52]. Collectively, these factors complicate the interpretation of results in pipeline comparisons and may introduce biases different from those observed in real environmental samples [38,45].
Comparative studies between QIIME1 and QIIME2 have been widely reported for bacterial 16S rRNA gene analysis [32,53,54], demonstrating that ASV-based QIIME2 provides higher reproducibility and resolution [55,56]. In contrast, direct comparative studies of QIIME1 and QIIME2 targeting fungal ITS regions are extremely limited [38,57,58], and the extent to which differences in analysis pipelines influence results, particularly in the fungal community analysis of residential environments, remains insufficiently validated [58,59]. Given the substantial diversity of the fungal-specific ITS region [46,60], the incompleteness of fungal reference databases [30], and the challenges associated with using mock communities [38,59], the impact of analysis method choice on results may be even greater for fungi than for bacteria [38,57,59].
Fungal communities are known to vary depending on building type and differences in air conditioning and ventilation systems, with fungal community composition differing between indoor and outdoor environments. Furthermore, bioinformatics pipelines have been suggested to potentially influence the results of fungal community analysis. Based on the above findings, we propose the following three hypotheses:
(1)
Indoor fungal communities differ depending on building type and variations in air conditioning and ventilation systems.
(2)
The composition of fungal communities differs between indoor and outdoor environments.
(3)
Differences are observed in the analysis results obtained using QIIME1 and QIIME2.
In this study, we aimed to compare fungal community analyses conducted with QIIME1 and QIIME2 using identical ITS sequence datasets obtained from various built environments, evaluating the outcomes in terms of diversity metrics and taxonomic composition. To achieve this goal, fungal communities were analyzed in various built environments across humid temperate and subtropical regions of Japan, including housing, office buildings, and movie theaters. Samples were collected from the air, surfaces, and settled dust in order to capture the diversity of indoor microbiomes.

2. Materials and Methods

2.1. Data Acquisition Source

The dataset was measured across groups (outdoors in summer, outdoors in winter, an office in summer, an office in winter, a movie theater, an air conditioner filter, and an air conditioner coil), totaling 84 samples (Table 1).
In the summer of 2019 and winter of 2020, a total of 32 samples were collected, comprising 8 samples from each group. Sampling was performed by attaching filters (pore size 0.3 μm) to air pumps in five office buildings in Tokyo and three in Aichi Prefecture, in addition to at their outdoor locations. The filters exhibit high capture efficiency of approximately 99.7 to 99.99% for 0.3 μm particles and demonstrate high performance for particles ranging from 0.1 μm to several micrometers. The primary particle capture mechanisms include diffusion, inertial impaction, and interception [61]. Each sample was collected at a flow rate of 3 L/min for 60 min, corresponding to a total volume of 180 L.
In the summer of 2023, a total of 18 samples were collected from the front and rear floor surfaces after three screenings at each of two movie theaters in Tokyo and one in Chiba Prefecture. Sterile cotton swabs were used to sample 10 cm × 10 cm areas located directly beneath seats, avoiding aisles. Each swab was stored individually in a sterile container (ST-25, ELMEX, Tokyo, Japan) containing 10 mL of phosphate-buffered saline (PBS) in a freezer (−20 °C) and subsequently processed in the laboratory.
In the summer of 2021, a total of 34 fungal samples were collected from 17 residential air conditioners in Kanagawa Prefecture—17 samples each from the filters and coil surfaces. Sterile swabs were used to wipe fungi from a 25 cm2 surface area of each component. The swabs were then placed individually in containers (ST-25, ELMEX) with 10 mL of sterile PBS in a freezer (−20 °C) and processed in the laboratory. For each component, samples were collected from two sites, corresponding to a total sampled area of 50 cm2.

2.2. DNA Extraction—ITS Sequencing

After the fungal samples adhering to surfaces were collected as described above, the swabs were processed using a Stomacher (Stomacher® 80 Biomaster, seward, West Sussex, UK). A mixture of 3 mL DNase-free water and 2 mL sample solution was prepared, and DNA extraction was performed with the Stomacher. The processed sample was then removed from the Stomacher bag, transferred to a 1.5 mL microcentrifuge tube, and centrifuged at 4 °C and 3000 rpm for 30 min using a KUBOTA5911 centrifuge (KUBOTA, Tokyo, Japan) to pellet the fungal material. DNA was subsequently purified using the NucleoSpin® Tissue Kit (740952, MACHEREY-NAGEL, Düren, Germany). During the purification process, the solution was vortexed, heated, mixed with ethanol, centrifuged, and subjected to additional standard processing steps according to the manufacturer’s protocol.
For each sample, the fungal ITS2rRNA gene was amplified using primers “5′-ACACTCTTTCCCTACACGACGCTCTTCCGATCT-GTGAATCATCGARTCTTTG-3′ (gITS7)” and “5′-GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT-TCCTCCGCTTATTGATATGC-3′ (ITS4)”. DNA amplification and ITS2 rRNA gene sequencing were performed using the Illumina NGS platform, with DNA processing outsourced to a commercial laboratory.
DNA quality was verified using the Agilent 2200 TapeStation (Agilent, Santa Clara, CA, USA), and all samples containing nucleic acid concentrations of sufficient quality, and only samples with nucleic acid concentrations meeting quality and quantity thresholds, were processed. The sequencing libraries were pooled, and PCR products were purified using AMPure XP beads (Beckman Coulter, San Jose, CA, USA) (1:1 bead-to-sample volume ratio) to enhance library quality.

2.3. OTU Sequence Assignment

Fungal OTUs were generated using QIIME1 (1.9.1). Reads with a quality score of ≤20 were removed, while no trimming of read length was applied (Table 2). Chimeric sequences were removed, and taxonomic assignment was performed using UNITE version 10 (19 February 2025). In QIIME1, operational taxonomic units (OTUs) were clustered using the 97% similarity threshold, which has been widely adopted in microbial ecology research. This threshold was originally derived from studies of the bacterial 16S rRNA gene and is based on the empirical criterion that sequences sharing 97% or greater similarity can generally be considered members of the same species [62]. Although the fungal ITS region exhibits greater variability than 16S rRNA, the same 97% threshold has been used for convenience to maintain consistency with past studies and comparability between datasets. We also adopted the 97% threshold.

2.4. ASV Sequence Determination

Fungal ASVs were generated using QIIME2 (2025.4). To perform quality filtering, the DADA2 parameters --p-max-ee-f and --p-max-ee-r were applied with the default value of 2.0 (Table 2). No unique trimming was performed; instead, read lengths that maximized the number of retained reads in DADA2 were used. Specifically, the parameters --p-trunc-len-f and --p-trunc-len-r were set to 160 (forward) and 150 (reverse), respectively. Taxonomic assignment was conducted using UNITE version 10 (19 February 2025).

2.5. Statistical Analysis Methods

In this study, the Wilcoxon signed-rank test was performed using IBM SPSS Statistics version 29 to compare taxonomic hierarchy and α diversity values of fungal communities between QIIME1 and QIIME2. Mantel test analyses were performed using R Statistical Software (v4.2.0), via the vegan R package (v2.6-4) to compare taxonomic hierarchy, β diversity values of fungal communities.
The Wilcoxon signed-rank test is a paired nonparametric test, and paired values were obtained from QIIME1 and QIIME2 analyses of the same samples. A p-value of <0.05 was considered statistically significant. For comparisons of taxonomic hierarchies, relative abundance was used. α diversity was assessed with three metrics: The observed species index, Shannon index, and PD whole tree index. To ensure robustness, a fixed sequencing depth (sample depth) was randomly resampled 10 times from each sample, and diversity indices were calculated for each iteration. The median value from the 10 obtained values for each metric was adopted as the representative metric for subsequent statistical testing. This approach provides more stable representative values while accounting for variation due to resampling. β diversity was evaluated using both weighted and unweighted UniFrac distance, Bray–Curtis distances, and Jaccard distances calculated by each pipeline.

3. Results

3.1. Microbial Community α and β Diversity

To evaluate differences between pipelines, we compared α and β diversity generated by QIIME1 and QIIME2. Statistical significance of α diversity was evaluated using the Wilcoxon signed-rank test. *: p < 0.05 and **: p < 0.001. Statistical significance of β diversity was evaluated using the Mantel test. *: p < 0.05, **: p < 0.01, and ***: p < 0.001.
For the observed species index, QIIME1 yielded significantly higher values than QIIME2 across all sample types (Figure 1). Variability was also lower in QIIME2, particularly in winter outdoor samples. For the PD whole tree index, QIIME1 likewise produced significantly higher values across all sample types (Figure 2). In contrast, the Shannon index showed no significant difference between the two pipelines (Figure 3). When comparing sampling methods and applications, the observed spices index and PD whole tree index indicated that indoor office air samples exhibited less variability than outdoor air samples in both pipelines, suggesting that indoor air quality is relatively consistent across building types. The Shannon index revealed no clear discernible difference in variability between indoor and outdoor air; however, seasonal patterns were evident, with lower variability observed in winter. Similarly, differences emerged between air conditioner filters and coils: for the observed species and PD whole tree indices, coils exhibited lower variability than filters.
To determine β diversity, pipelines were compared using Weighted and Unweighted UniFrac distances. Weighted UniFrac distances were significantly higher in QIIME1 than in QIIME2 across all sample types (Figure 4). Unweighted UniFrac distances also showed significant differences between the two pipelines, except for winter outdoor samples (Figure 5). Regarding Bray–Curtis distance, significant differences were observed between the two pipelines for all sample types except the winter outdoor sample (Figure 6). Regarding Jaccard distance, significant differences were observed between the two pipelines for all sample types except the winter outdoor and office samples (Figure 7).

3.2. Taxonomic Differences Related to the Pipeline

We compared differences in the number of OTUs and ASVs obtained after base sequence determination (Table 3). Sequencing depth was confirmed using a rarefaction curve. Sequence analysis yielded 130,977 ± 18,076 reads from the summer outdoor sample, 76,097 ± 8832 reads from the winter outdoor sample, 125,850 ± 20,762 reads from the summer office sample, 76,714 ± 10,179 reads from the winter office sample, 96,637 ± 18,260 reads from the movie theater sample, 28,824 ± 7229 reads from the air conditioner filter sample, and 27,349 ± 5948 reads from the air conditioner coil sample. After processing with QIIME1 and QIIME2 (DADA2), QIIME1 consistently produced higher values; in comparison, QIIME2 yielded lower counts and diversity for ASVs. In terms of base-call reduction rates during sequence analysis, QIIME1 showed reductions of up to 75% in some summer outdoor samples, whereas QIIME2 showed reductions of up to 42% in some winter office samples, reflecting differences in filtering stringency between the pipelines. Overall, significant differences were observed in OTU and ASV counts, and when focusing on specific groups, the average difference was at least twofold.
The Venn diagram presented in Figure 8 illustrates the number of genera detected commonly across pipelines and those detected exclusively by each. The proportion of shared genera across sample types ranged from 40.5% to 79.4%, indicating relatively few overlapping genera. Genera detected exclusively using QIIME1 accounted for 20.3–58.8%; in comparison, those detected only by QIIME2 accounted for 0.3–2.7%, demonstrating that QIIME1 detected a greater number of genera.
Furthermore, we analyzed genera restricted to those with a relative abundance of ≥1% (Figure 9). The proportion of genera detected across all sample types ranged from 86.6% to 96.4%, indicating that relatively few genera were shared among all types. Genera detected exclusively using QIIME1 accounted for 3.6–13.4%; in comparison, those detected only by QIIME2 accounted for 0.0–2.0%, suggesting that QIIME1 detected more genera. In contrast, when genera with relative abundances below 1% were considered, significant differences between the two pipelines became apparent.
We also examined differences related to sampling methods and applications. Seasonal variation was observed in the number and proportion of fungal genera shared between indoor and outdoor samples, with more genera commonly detected in winter. Differences were likewise noted between air-conditioner filters and coils, with coils showing higher numbers of shared genera. When restricting the analysis to genera with a relative abundance of ≥1%, coils still exhibited higher numbers than filters, although the proportions were comparable.

3.3. Relative Composition Ratios

Differences between pipelines were examined to determine the top 20 genera with the highest relative abundances (Figure 10).
Several genera, including Parachaetomium, Mortierellaceae_gen_Incertae_sedis, Saccharomycetales_gen_Incertae_sedis, Teratosphaeriaceae_gen_Incertae_sedis, Strobilurus, Zoophthora, and Orbiliaceae_gen_Incertae_sedis, were not detected using QIIME2. Differences were also noted for genera that were abundantly detected but remain unresolved at the genus level.
Among all detected genera, Aspergillus was the most prevalent, ranking third or higher across all sample types. Although some groups exhibited significant differences between the two pipelines, the overall results were relatively consistent. Notably, the genera Aspergillus, Malassezia, Trametes, and Schizophylum include fungal species known to cause disease.

4. Discussion

Metabarcoding has become an indispensable tool for microbial community analysis, but the resulting data is vast, making bioinformatics analysis challenging. Selecting an appropriate pipeline is crucial for establishing analysis protocols.
In this study, we compared QIIME1 (97% threshold) for OTU analysis and QIIME2 (DADA2) for ASV analysis using a total of 84 samples from built environment microbiomes: summer and winter outdoor air, summer and winter indoor office air, cinema floor surface fungi, and residential air conditioner filter and coil fungi. QIIME1, frequently used until around 2020, has received reduced attention since the release of QIIME2 in 2018. However, the authors of studies comparing bacterial community analysis pipelines have reported that when the goal is “screening for rare or low-frequency candidate taxa,” QIIME1 may perform better than other methods in approaches that tolerate an increased potential for false positives while combining subsequent rigorous validation. The above demonstrates that new analysis methods are not always optimal and highlights the importance of selecting the appropriate tool based on research objectives [32,53,63,64,65]. However, the results of studies involving the use of mock communities have shown that QIIME1 (uclust) tends to generate spurious OTUs and assign a large portion of total counts to them [53]. Furthermore, QIIME1 tends to have a relatively high false positive rate and a low false negative rate; in comparison, QIIME2 (DADA2) exhibits a lower false positive rate and a slightly higher false negative rate. In studies targeting bacteria, the characteristics of each pipeline have been broadly confirmed [66].
In this study, we graphically represented and analyzed differences in α diversity, β diversity, read counts, number of detected genera, and relative abundance. As a result, a difference was confirmed between QIMME1 and QIMME2, consistent with previous studies.
The results showed significant differences between the pipelines in most analyses except for the Shannon index. The difference in the number of filtered reads was significant, and this result was reflected in the observed species. This finding is thought to be strongly influenced by clustering and filtering in ASV analysis, consistent with previous studies. QIIME1 outputs a very large number of OTUs, most of which are estimated to be spurious [53], a factor that is believed to have similarly manifested in this study. This finding was thought to be strongly influenced by the effects of clustering and filtering in ASV analysis. Furthermore, the PD whole tree value was lower for QIIME2, likely due to the reduction in species number. No significant differences were observed in the Shannon index across all samples. The lack of significance in the Shannon index suggests that the abundance and evenness contributing at high frequencies are nearly identical between the two pipelines. However, differences in the abundance of low-frequency species and the presence of numerous false positives are indicated. Previous studies have noted that differences exist between pipelines, suggesting it is more appropriate to focus on qualitative differences between samples rather than quantitative values [67,68]. These differences likely exist because ASV analysis removes many rare bacterial species, and since the Shannon index measures community evenness, removing species with inherently low relative abundances does not alter the index value. Furthermore, when focusing on differences in sampling methods and applications, we found that variations in the observed species and PD whole tree indices differed between office and outdoor airborne samples. Outdoor air is subject to numerous external factors such as temperature, humidity, wind direction, and rainfall, with conditions changing hourly. In contrast, Japanese office spaces are regulated by the Act on Maintenance of Sanitation in Buildings to maintain specified temperature and humidity ranges, likely resulting in less variation in fungal communities. However, while no difference was observed between offices and outdoors for the Shannon index, a difference was seen between summer and winter. This difference is thought to be because winter is characterized by low temperature and humidity, creating an environment unsuitable for fungal growth. Consequently, only a limited number of fungi grow during this time period, resulting in a significantly different microbial community balance. Furthermore, focusing on the coils and filters of residential air conditioners, we found that filters showed higher values for all three indices. Compared to coils, filters harbored a greater number of observed species, exhibited greater phylogenetic diversity, and demonstrated higher evenness. During cooling operation, the humidity inside air conditioners is known to be higher than that of the indoor air [69], creating an environment conducive to fungal growth. In contrast, filters are located relatively close to the indoor environment and are easily influenced by outdoor air entering through open windows or ventilation systems. However, the coils are situated within the confined space of the unit, which makes it difficult for spores and various fungi originating from outdoor air to enter. Therefore, the α-diversity observed on the coils was considered to be lower than that observed on the filters. Previous bacterial studies have shown that the Shannon index is higher on filters than on coils. Since bacteria, like fungi, tend to thrive in warm and humid environments, this finding suggests that a similar difference may also exist for fungi [69].
In this study, we utilized the UniFrac distance, analyzed based on species phylogenetic trees. For Weighted UniFrac, QIIME2 diversity indices were found to be significantly lower across all sample groups. As mentioned above, ASV analysis that excludes low-frequency species is a denoising method. Since it performs misalignment estimation and correction, it tends to suppress the false positive rate. However, it has been noted that sensitivity (the possibility of missing true sequences) may be slightly reduced, potentially leading to an underestimation of the diversity of rare bacterial species [70].
Comparison of read counts revealed no significant difference in the number of libraries but a significant difference between OTUs and ASVs, suggesting that clustering and filtering effects are substantial in ASV analysis (Figure 11). The 95% confidence intervals and 95% prediction intervals are shown in Figure 11. Statistical significance was evaluated using the Wilcoxon signed-rank test. This difference was more than sixfold, indicating a large disparity between OTUs and ASVs analyses, highlighting the importance of pipeline selection (Figure 11b). Furthermore, the range of OTU values relative to raw library sizes showed a smaller difference between minimum and maximum values compared to ASVs, indicating differences in sensitivity to clustering and filtering between samples.
In the Venn diagram analysis, fungi detected identified to the genus level and those additionally detected with a relative abundance of 1% or higher were analyzed based on the number of detections. In the analysis covering all fungal genera, although there were differences in absolute sample numbers between sample groups, the proportion of fungi uniquely detected using QIIME2 was lower compared to QIIME1. In the analysis restricted to relative abundance ≥1%, the proportion of fungal species detected only by QIIME1 was smaller compared to the previous analysis. The results of this analysis also revealed that pipeline differences were significant for fungi with low relative abundance.
To further deepen the analysis, we presented heatmaps showing the top 20 most abundant fungal genera detected in each sample. Furthermore, some fungal genera were detected only with QIIME1, whereas several genera, including unassigned taxa, were not detected with QIIME2. This finding suggests that differences in reference databases and classification methods influence the analysis results. Therefore, strong pipeline dependency exists even at the genus level, necessitating caution in interpreting results, particularly in fungal community analysis [65]. Among the detected genera, Aspergillus was prevalent overall. In summer, regarding airborne fungal samples collected in offices and outdoors, the results suggest that outdoor influences also affect indoor environments.

5. Conclusions

In this study, we analyzed differences between pipelines using samples collected from various built environments. Previous comparative studies of microbial community analysis pipelines have focused on bacterial communities, mock communities, or soil fungi, but not in built environments, and the number of samples compared is also small in such studies. In this regard, this study is novel, and the findings are summarized below.
(1)
There are differences between OTU analysis and ASV analysis. Since OTU analysis involves OTU clustering at 97% similarity, sequence errors and clustering-induced partitioning can increase the apparent number of taxa, particularly affecting rare fungi. Therefore, when performing diversity analyses, OTU analysis tends to yield higher diversity values.
(2)
Regarding abundantly detected fungi, OTU analysis detects a greater number thereof; however, caution is warranted, as this study also suggested the possibility of false positives.
(3)
Aspergillus was predominantly detected across all groups, irrespective of pipeline or building application.
(4)
The α diversity index was higher in outdoor air than in office interiors, and the air conditioning filters exhibited greater α diversity than coils. These findings indicate an environment conducive to the growth of numerous fungi.

Author Contributions

Conceptualization: K.W. and U.Y.; methodology: K.W. and U.Y.; investigation: K.W. and U.Y.; writing—original draft preparation: K.W.; writing—review and editing: K.W. and U.Y.; visualization: K.W.; supervision: U.Y.; project administration: U.Y.; funding acquisition: U.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by Research Grants on Allergic Disease and Immunology (21ek0410055, 22ek0410097) from the Japan Agency for Medical Research and Development, Comprehensive Research Project on Health, Safety and Crisis Management Measures (21LA1005) and (21LA1501) from the Ministry of Health, Labour and Welfare, Japan.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Hawksworth, D.L.; Lücking, R. Fungal Diversity Revisited: 2.2 to 3.8 Million Species. Microbiol. Spectr. 2017, 5, 17. [Google Scholar] [CrossRef]
  2. Blackwell, M. The Fungi: 1, 2, 3... 5.1 Million Species? Am. J. Bot. 2011, 98, 426–438. [Google Scholar] [CrossRef]
  3. Fisher, M.C.; Henk, D.A.; Briggs, C.J.; Brownstein, J.S.; Madoff, L.C.; McCraw, S.L.; Gurr, S.J. Emerging Fungal Threats to Animal, Plant and Ecosystem Health. Nature 2012, 484, 186–194. [Google Scholar] [CrossRef]
  4. Denning, D.W.; Pashley, C.; Hartl, D.; Wardlaw, A.; Godet, C.; Del Giacco, S.; Delhaes, L.; Sergejeva, S. Fungal Allergy in Asthma–State of the Art and Research Needs. Clin. Transl. Allergy 2014, 4, 14. [Google Scholar] [CrossRef] [PubMed]
  5. Bennett, J.W.; Klich, M. Mycotoxins. Clin. Microbiol. Rev. 2003, 16, 497–516. [Google Scholar] [CrossRef] [PubMed]
  6. Brown, G.D.; Denning, D.W.; Gow, N.A.R.; Levitz, S.M.; Netea, M.G.; White, T.C. Hidden Killers: Human Fungal Infections. Sci. Transl. Med. 2012, 4, 165rv13. [Google Scholar] [CrossRef]
  7. Šunić, I.; Havaš Auguštin, D.; Šarac, J.; Michl, K.; Cernava, T.; Jakobsen, R.R.; Mešić, A.; Novokmet, N.; Lovrić, M. Associations Between Indoor Fungal Community Structures and Environmental Factors: Insights from the Evidence-Driven Indoor Air-Quality Improvement Study. J. Fungi 2025, 11, 261. [Google Scholar] [CrossRef]
  8. Schoch, C.L.; Seifert, K.A.; Huhndorf, S.; Robert, V.; Spouge, J.L.; Levesque, C.A.; Chen, W.; Fungal Barcoding Consortium; Fungal Barcoding Consortium Author List; Bolchacova, E.; et al. Nuclear Ribosomal Internal Transcribed Spacer (ITS) Region as a Universal DNA Barcode Marker for Fungi. Proc. Natl. Acad. Sci. USA 2012, 109, 6241–6246. [Google Scholar] [CrossRef] [PubMed]
  9. Op De Beeck, M.; Lievens, B.; Busschaert, P.; Declerck, S.; Vangronsveld, J.; Colpaert, J.V. Comparison and Validation of Some ITS Primer Pairs Useful for Fungal Metabarcoding Studies. PLoS ONE 2014, 9, e97629. [Google Scholar] [CrossRef]
  10. Procopio, N.; Ghignone, S.; Voyron, S.; Chiapello, M.; Williams, A.; Chamberlain, A.; Mello, A.; Buckley, M. Soil Fungal Communities Investigated by Metabarcoding Within Simulated Forensic Burial Contexts. Front. Microbiol. 2020, 11, 1686. [Google Scholar] [CrossRef]
  11. Jiang, S.; Chen, Y.; Han, S.; Lv, L.; Li, L. Next-Generation Sequencing Applications for the Study of Fungal Pathogens. Microorganisms 2022, 10, 1882. [Google Scholar] [CrossRef]
  12. Chawla, H.; Anand, P.; Garg, K.; Bhagat, N.; Varmani, S.G.; Bansal, T.; McBain, A.J.; Marwah, R.G. A Comprehensive Review of Microbial Contamination in the Indoor Environment: Sources, Sampling, Health Risks, and Mitigation Strategies. Front. Public Health 2023, 11, 1285393. [Google Scholar] [CrossRef]
  13. Méheust, D.; Le Cann, P.; Reboux, G.; Millon, L.; Gangneux, J.-P. Indoor Fungal Contamination: Health Risks and Measurement Methods in Hospitals, Homes and Workplaces. Crit. Rev. Microbiol. 2014, 40, 248–260. [Google Scholar] [CrossRef] [PubMed]
  14. Indoor Fungal Diversity and Asthma: A Meta-Analysis and Systematic Review of Risk Factors—Journal of Allergy and Clinical Immunology. Available online: https://www.jacionline.org/article/S0091-6749(14)00952-X/fulltext (accessed on 15 August 2025).
  15. Portnoy, J.M.; Kwak, K.; Dowling, P.; VanOsdol, T.; Barnes, C. Health Effects of Indoor Fungi. Ann. Allergy Asthma Immunol. 2005, 94, 313–319; quiz 319–322, 390. [Google Scholar] [CrossRef] [PubMed]
  16. Osborne, N.J.; Thornton, C.R.; Sharpe, R.A. Indoor Fungal Exposure and Allergic Respiratory Disease. Curr. Allergy Asthma Rep. 2015, 15, 71. [Google Scholar] [CrossRef] [PubMed]
  17. Amend, A.S.; Seifert, K.A.; Samson, R.; Bruns, T.D. Indoor Fungal Composition Is Geographically Patterned and More Diverse in Temperate Zones than in the Tropics. Proc. Natl. Acad. Sci. USA 2025, 107, 13748–13753. [Google Scholar] [CrossRef]
  18. Niculita-Hirzel, H.; Wild, P.; Hirzel, A.H. Season, Vegetation Proximity and Building Age Shape the Indoor Fungal Communities’ Composition at City-Scale. J. Fungi 2022, 8, 1045. [Google Scholar] [CrossRef]
  19. Wang, S.; Zheng, X.; Ye, J.; Sun, Z.; Chen, Z.; Cao, G.; Zhang, Y.; Shen, F.; Gao, C.X.; Qian, H. Impact of Climate Zones and Seasons on Indoor Airborne Microbial Communities: Insights from a Comprehensive Analysis. Sci. Total Environ. 2024, 926, 171879. [Google Scholar] [CrossRef]
  20. Sharpe, R.; Thornton, C.R.; Osborne, N.J. Modifiable Factors Governing Indoor Fungal Diversity and Risk of Asthma. Clin. Exp. Allergy 2014, 44, 631–641. [Google Scholar] [CrossRef]
  21. Weikl, F.; Tischer, C.; Probst, A.J.; Heinrich, J.; Markevych, I.; Jochner, S.; Pritsch, K. Fungal and Bacterial Communities in Indoor Dust Follow Different Environmental Determinants. PLoS ONE 2016, 11, e0154131. [Google Scholar] [CrossRef]
  22. Efthymiopoulos, S.; Aktas, Y.D.; Altamirano, H. A Critical Review of Analysis Techniques for the Assessment of the Indoor Fungal Burden; ScienceOpen: Berlin, Germany, 2023. [Google Scholar]
  23. Rittenour, W.R.; Ciaccio, C.E.; Barnes, C.S.; Kashon, M.L.; Lemons, A.R.; Beezhold, D.H.; Green, B.J. Internal Transcribed Spacer rRNA Gene Sequencing Analysis of Fungal Diversity in Kansas City Indoor Environments. Environ. Sci. Process. Impacts 2013, 16, 33–43. [Google Scholar] [CrossRef]
  24. Green, B.J.; Lemons, A.R.; Park, Y.; Cox-Ganser, J.M.; Park, J.-H. Assessment of Fungal Diversity in a Water-Damaged Office Building. J. Occup. Environ. Hyg. 2017, 14, 285–293. [Google Scholar] [CrossRef] [PubMed]
  25. WHO Guidelines for Indoor Air Quality: Dampness and Mould. Available online: https://www.who.int/publications/i/item/9789289041683 (accessed on 1 November 2025).
  26. U.S. Environmental Protection Agency. Biological Pollutants’ Impact on Indoor Air Quality. Available online: https://www.epa.gov/indoor-air-quality-iaq/biological-pollutants-impact-indoor-air-quality (accessed on 1 November 2025).
  27. Okinaga, T.; Masago, A.; Nambu, T.; Mashimo, C.; Okinaga, T. Amplicon Sequence Variant-Based Oral Microbiome Analysis Using QIIME 2. J. Osaka Dent. Univ. 2020, 54, 273–281. [Google Scholar] [CrossRef]
  28. Thompson, L.R.; Anderson, S.R.; Uyl, P.A.D.; Patin, N.V.; Lim, S.J.; Sanderson, G.; Goodwin, K.D. Tourmaline: A Containerized Workflow for Rapid and Iterable Amplicon Sequence Analysis Using QIIME 2 and Snakemake. GigaScience 2022, 11, giac066. [Google Scholar] [CrossRef] [PubMed]
  29. Taylor, D.L.; Houston, S. A Bioinformatics Pipeline for Sequence-Based Analyses of Fungal Biodiversity. In Fungal Genomics: Methods and Protocols; Xu, J.-R., Bluhm, B.H., Eds.; Humana Press: Totowa, NJ, USA, 2011; pp. 141–155. ISBN 978-1-61779-040-9. [Google Scholar]
  30. Kõljalg, U.; Nilsson, R.H.; Abarenkov, K.; Tedersoo, L.; Taylor, A.F.S.; Bahram, M.; Bates, S.T.; Bruns, T.D.; Bengtsson-Palme, J.; Callaghan, T.M.; et al. Towards a Unified Paradigm for Sequence-Based Identification of Fungi. Mol. Ecol. 2013, 22, 5271–5277. [Google Scholar] [CrossRef]
  31. Blaalid, R.; Kumar, S.; Nilsson, R.H.; Abarenkov, K.; Kirk, P.M.; Kauserud, H. ITS1 Versus ITS2 as DNA Metabarcodes for Fungi. Mol. Ecol. Resour. 2013, 13, 218–224. [Google Scholar] [CrossRef]
  32. Straub, D.; Blackwell, N.; Langarica-Fuentes, A.; Peltzer, A.; Nahnsen, S.; Kleindienst, S. Interpretations of Environmental Microbial Community Studies Are Biased by the Selected 16S rRNA (Gene) Amplicon Sequencing Pipeline. Front. Microbiol. 2020, 11, 550420. [Google Scholar] [CrossRef] [PubMed]
  33. Colabella, C.; Corte, L.; Roscini, L.; Bassetti, M.; Tascini, C.; Mellor, J.C.; Meyer, W.; Robert, V.; Vu, D.; Cardinali, G. NGS Barcode Sequencing in Taxonomy and Diagnostics, an Application in “Candida” Pathogenic Yeasts with a Metagenomic Perspective. IMA Fungus 2018, 9, 91–105. [Google Scholar] [CrossRef]
  34. Shamim, M.; Kumar, M.; Kumar, R.R.; Pandey, P.; Srivastava, D.; Kumar, D.; Khan, N.A.; Kumar, R.R.; Singh, K.N. Understanding the Diversity of Aspergillus by Next-Generation Sequencing. In New and Future Developments in Microbial Biotechnology and Bioengineering; Gupta, V.K., Ed.; Elsevier: Amsterdam, The Netherlands, 2016; pp. 29–40. ISBN 978-0-444-63505-1. [Google Scholar]
  35. McTaggart, L.R.; Copeland, J.K.; Surendra, A.; Wang, P.W.; Husain, S.; Coburn, B.; Guttman, D.S.; Kus, J.V. Mycobiome Sequencing and Analysis Applied to Fungal Community Profiling of the Lower Respiratory Tract During Fungal Pathogenesis. Front. Microbiol. 2019, 10, 512. [Google Scholar] [CrossRef]
  36. Bolyen, E.; Rideout, J.R.; Dillon, M.R.; Bokulich, N.A.; Abnet, C.C.; Al-Ghalith, G.A.; Alexander, H.; Alm, E.J.; Arumugam, M.; Asnicar, F.; et al. Reproducible, Interactive, Scalable and Extensible Microbiome Data Science Using QIIME 2. Nat. Biotechnol. 2019, 37, 852–857. [Google Scholar] [CrossRef]
  37. Liu, G.; Li, T.; Zhu, X.; Zhang, X.; Wang, J. An Independent Evaluation in a CRC Patient Cohort of Microbiome 16S rRNA Sequence Analysis Methods: OTU Clustering, DADA2, and Deblur. Front. Microbiol. 2023, 14, 1178744. [Google Scholar] [CrossRef]
  38. Rolling, T.; Zhai, B.; Frame, J.V.; Hohl, T.M.; Taur, Y. Customization of a Dada2-Based Pipeline for Fungal Internal Transcribed Spacer 1 (ITS 1) Amplicon Datasets. JCI Insight 2021, 7, e151663. [Google Scholar] [CrossRef] [PubMed]
  39. Nearing, J.T.; Douglas, G.M.; Comeau, A.M.; Langille, M.G.I. Denoising the Denoisers: An Independent Evaluation of Microbiome Sequence Error-Correction Approaches. PeerJ 2018, 6, e5364. [Google Scholar] [CrossRef]
  40. Maki, K.A.; Wolff, B.; Varuzza, L.; Green, S.J.; Barb, J.J. Multi-Amplicon Microbiome Data Analysis Pipelines for Mixed Orientation Sequences Using QIIME2: Assessing Reference Database, Variable Region and Pre-Processing Bias in Classification of Mock Bacterial Community Samples. PLoS ONE 2023, 18, e0280293. [Google Scholar] [CrossRef]
  41. Qin, Y.; Wu, L.; Zhang, Q.; Wen, C.; Van Nostrand, J.D.; Ning, D.; Raskin, L.; Pinto, A.; Zhou, J. Effects of Error, Chimera, Bias, and GC Content on the Accuracy of Amplicon Sequencing. mSystems 2023, 8, e01025-23. [Google Scholar] [CrossRef]
  42. Yeh, Y.-C.; Needham, D.M.; Sieradzki, E.T.; Fuhrman, J.A. Taxon Disappearance from Microbiome Analysis Indicates Need for Mock Communities as a Standard in Every Sequencing Run. mSystems 2017, 3, e00023-18. [Google Scholar] [CrossRef]
  43. Baer, M.; Höppe, L.; Seel, W.; Lipski, A. Impact of DNA Extraction, PCR Amplification, Sequencing, and Bioinformatic Analysis on Food-Associated Mock Communities Using PacBio Long-Read Amplicon Sequencing. BMC Microbiol. 2024, 24, 521. [Google Scholar] [CrossRef]
  44. Bakker, M.G. A Fungal Mock Community Control for Amplicon Sequencing Experiments. Mol. Ecol. Resour. 2018, 18, 541–556. [Google Scholar] [CrossRef] [PubMed]
  45. Palmer, J.M.; Jusino, M.A.; Banik, M.T.; Lindner, D.L. Non-Biological Synthetic Spike-in Controls and the AMPtk Software Pipeline Improve Mycobiome Data. PeerJ 2018, 6, e4925. [Google Scholar] [CrossRef] [PubMed]
  46. Yang, R.-H.; Su, J.-H.; Shang, J.-J.; Wu, Y.-Y.; Li, Y.; Bao, D.-P.; Yao, Y.-J. Evaluation of the Ribosomal DNA Internal Transcribed Spacer (ITS), Specifically ITS1 and ITS2, for the Analysis of Fungal Diversity by Deep Sequencing. PLoS ONE 2018, 13, e0206428. [Google Scholar] [CrossRef]
  47. De Filippis, F.; Laiola, M.; Blaiotta, G.; Ercolini, D. Different Amplicon Targets for Sequencing-Based Studies of Fungal Diversity|Applied and Environmental Microbiology. Appl. Environ. Microbiol. 2017, 83, e00905-17. [Google Scholar] [CrossRef]
  48. Porras-Alfaro, A.; Liu, K.-L.; Kuske, C.R.; Xie, G. From Genus to Phylum: Large-Subunit and Internal Transcribed Spacer rRNA Operon Regions Show Similar Classification Accuracies Influenced by Database Composition. Appl. Environ. Microbiol. 2014, 80, 829–840. [Google Scholar] [CrossRef]
  49. Li, S.; Deng, Y.; Wang, Z.; Zhang, Z.; Kong, X.; Zhou, W.; Yi, Y.; Qu, Y. Exploring the Accuracy of Amplicon-Based Internal Transcribed Spacer Markers for a Fungal Community. Mol. Ecol. Resour. 2020, 20, 170–184. [Google Scholar] [CrossRef]
  50. Abarenkov, K.; Somervuo, P.; Nilsson, R.H.; Kirk, P.M.; Huotari, T.; Abrego, N.; Ovaskainen, O. Protax-Fungi: A Web-Based Tool for Probabilistic Taxonomic Placement of Fungal Internal Transcribed Spacer Sequences. New Phytol. 2018, 220, 517–525. [Google Scholar] [CrossRef]
  51. Nilsson, R.H.; Tedersoo, L.; Ryberg, M.; Kristiansson, E.; Hartmann, M.; Unterseher, M.; Porter, T.M.; Bengtsson-Palme, J.; Walker, D.M.; de Sousa, F.; et al. A Comprehensive, Automatically Updated Fungal ITS Sequence Dataset for Reference-Based Chimera Control in Environmental Sequencing Efforts. Microbes Environ. 2015, 30, 145–150. [Google Scholar] [CrossRef] [PubMed]
  52. Abarenkov, K.; Nilsson, R.H.; Larsson, K.-H.; Taylor, A.F.S.; May, T.W.; Frøslev, T.G.; Pawlowska, J.; Lindahl, B.; Põldmaa, K.; Truong, C.; et al. The UNITE Database for Molecular Identification and Taxonomic Communication of Fungi and Other Eukaryotes: Sequences, Taxa and Classifications Reconsidered. Nucleic Acids Res. 2024, 52, D791–D797. [Google Scholar] [CrossRef] [PubMed]
  53. Prodan, A.; Tremaroli, V.; Brolin, H.; Zwinderman, A.H.; Nieuwdorp, M.; Levin, E. Comparing Bioinformatic Pipelines for Microbial 16S rRNA Amplicon Sequencing. PLoS ONE 2020, 15, e0227434. [Google Scholar] [CrossRef]
  54. Almeida, A.; Mitchell, A.L.; Tarkowska, A.; Finn, R.D. Benchmarking Taxonomic Assignments Based on 16S rRNA Gene Profiling of the Microbiota from Commonly Sampled Environments. Gigascience 2018, 7, giy054. [Google Scholar] [CrossRef]
  55. Bokulich, N.A.; Kaehler, B.D.; Rideout, J.R.; Dillon, M.; Bolyen, E.; Knight, R.; Huttley, G.A.; Gregory Caporaso, J. Optimizing Taxonomic Classification of Marker-Gene Amplicon Sequences with QIIME 2’s q2-Feature-Classifier Plugin. Microbiome 2018, 6, 90. [Google Scholar] [CrossRef]
  56. Chiarello, M.; McCauley, M.; Villéger, S.; Jackson, C.R. Ranking the Biases: The Choice of OTUs vs. ASVs in 16S rRNA Amplicon Data Analysis Has Stronger Effects on Diversity Measures than Rarefaction and OTU Identity Threshold. PLoS ONE 2022, 17, e0264443. [Google Scholar] [CrossRef] [PubMed]
  57. Gweon, H.S.; Oliver, A.; Taylor, J.; Booth, T.; Gibbs, M.; Read, D.S.; Griffiths, R.I.; Schonrogge, K. PIPITS: An Automated Pipeline for Analyses of Fungal Internal Transcribed Spacer Sequences from the Illumina Sequencing Platform. Methods Ecol. Evol. 2015, 6, 973–980. [Google Scholar] [CrossRef] [PubMed]
  58. White, J.R.; Maddox, C.; White, O.; Angiuoli, S.V.; Fricke, W.F. CloVR-ITS: Automated Internal Transcribed Spacer Amplicon Sequence Analysis Pipeline for the Characterization of Fungal Microbiota. Microbiome 2013, 1, 6. [Google Scholar] [CrossRef] [PubMed]
  59. Mlaga, K.D.; Mathieu, A.; Beauparlant, C.J.; Ott, A.; Khodr, A.; Perin, O.; Droit, A. HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis. Front. Microbiol. 2021, 12, 640693. [Google Scholar] [CrossRef] [PubMed]
  60. Nilsson, R.H.; Kristiansson, E.; Ryberg, M.; Hallenberg, N.; Larsson, K.-H. Intraspecific ITS Variability in the Kingdom Fungi as Expressed in the International Sequence Databases and Its Implications for Molecular Species Identification. Evol. Bioinform. Online 2008, 4, EBO.S653. [Google Scholar] [CrossRef]
  61. Verreault, D.; Rousseau, G.M.; Gendron, L.; Massé, D.; Moineau, S.; Duchaine, C. Comparison of Polycarbonate and Polytetrafluoroethylene Filters for Sampling of Airborne Bacteriophages. Aerosol Sci. Technol. 2010, 44, 197–201. [Google Scholar] [CrossRef]
  62. Stackebrandt, E.; Goebel, B.M. Taxonomic Note: A Place for DNA-DNA Reassociation and 16S rRNA Sequence Analysis in the Present Species Definition in Bacteriology. Int. J. Syst. Evol. Microbiol. 1994, 44, 846–849. [Google Scholar] [CrossRef]
  63. Moossavi, S.; Atakora, F.; Fehr, K.; Khafipour, E. Biological Observations in Microbiota Analysis Are Robust to the Choice of 16S rRNA Gene Sequencing Processing Algorithm: Case Study on Human Milk Microbiota. BMC Microbiol. 2020, 20, 290. [Google Scholar] [CrossRef]
  64. Kameoka, S.; Motooka, D.; Watanabe, S.; Kubo, R.; Jung, N.; Midorikawa, Y.; Shinozaki, N.O.; Sawai, Y.; Takeda, A.K.; Nakamura, S. Benchmark of 16S rRNA Gene Amplicon Sequencing Using Japanese gut Microbiome Data from the V1–V2 and V3–V4 Primer Sets. BMC Genom. 2021, 22, 527. [Google Scholar] [CrossRef]
  65. Theresa, R.; Praeg, N.; Galla, G.; Seeber, J.; Hauffe, H.C.; Illmer, P. Comparison of Commonly Used Software Pipelines for Analyzing Fungal Metabarcoding Data. BMC Genom. 2024, 25, 1085. [Google Scholar] [CrossRef]
  66. Fares, M.; Tharwat, E.K.; Cleenwerck, I.; Monsieurs, P.; Van Houdt, R.; Vandamme, P.; El-Hadidi, M.; Mysara, M. The Unresolved Struggle of 16S rRNA Amplicon Sequencing: A Benchmarking Analysis of Clustering and Denoising Methods. Environ. Microbiome 2025, 20, 51. [Google Scholar] [CrossRef]
  67. Pauvert, C.; Buée, M.; Laval, V.; Edel-Hermann, V.; Fauchery, L.; Gautier, A.; Lesur, I.; Vallance, J.; Vacher, C. Bioinformatics Matters: The Accuracy of Plant and Soil Fungal Community Data is Highly Dependent on the Metabarcoding Pipeline. Fungal Ecol. 2019, 41, 23–33. [Google Scholar] [CrossRef]
  68. Joos, L.; Beirinckx, S.; Haegeman, A.; Debode, J.; Vandecasteele, B.; Baeyen, S.; Goormachtig, S.; Clement, L.; De Tender, C. Daring to Be Differential: Metabarcoding Analysis of Soil and Plant-Related Microbial Communities Using Amplicon Sequence Variants and Operational Taxonomical Units. BMC Genom. 2020, 21, 733. [Google Scholar] [CrossRef]
  69. Watanabe, K.; Yanagi, U.; Shiraishi, Y.; Harada, K.; Ogino, F.; Asano, K. Bacterial Communities in Various Parts of Air-Conditioning Units in 17 Japanese Houses. Microorganisms 2022, 10, 2246. [Google Scholar] [CrossRef] [PubMed]
  70. Mohsen, A.; Park, J.; Chen, Y.-A.; Kawashima, H.; Mizuguchi, K. Impact of Quality Trimming on the Efficiency of Reads Joining and Diversity Analysis of Illumina Paired-End Reads in the Context of QIIME1 and QIIME2 Microbiome Analysis Frameworks. BMC Bioinform. 2019, 20, 581. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The quartile values of the observed species index (Blue: QIIME1; Red: QIIME2). *: p < 0.05, **: p < 0.001.
Figure 1. The quartile values of the observed species index (Blue: QIIME1; Red: QIIME2). *: p < 0.05, **: p < 0.001.
Microorganisms 13 02545 g001
Figure 2. The quartile values of the PD whole tree index (Blue: QIIME1; Red: QIIME2). *: p < 0.05, **: p < 0.001.
Figure 2. The quartile values of the PD whole tree index (Blue: QIIME1; Red: QIIME2). *: p < 0.05, **: p < 0.001.
Microorganisms 13 02545 g002
Figure 3. The quartile values of the Shannon index (Blue: QIIME1; Red: QIIME2). N.S.: Not Significant.
Figure 3. The quartile values of the Shannon index (Blue: QIIME1; Red: QIIME2). N.S.: Not Significant.
Microorganisms 13 02545 g003
Figure 4. The quartile values of Weighted UniFrac (Blue: QIIME1; Red: QIIME2). **: p < 0.01, and ***: p < 0.001.
Figure 4. The quartile values of Weighted UniFrac (Blue: QIIME1; Red: QIIME2). **: p < 0.01, and ***: p < 0.001.
Microorganisms 13 02545 g004
Figure 5. The quartile values of Unweighted UniFrac (Blue: QIIME1; Red: QIIME2). *: p < 0.05, and ***: p < 0.001, N.S.: Not Significant.
Figure 5. The quartile values of Unweighted UniFrac (Blue: QIIME1; Red: QIIME2). *: p < 0.05, and ***: p < 0.001, N.S.: Not Significant.
Microorganisms 13 02545 g005
Figure 6. The quartile values of Bray–Curtis distance (Blue: QIIME1; Red: QIIME2). *: p < 0.05, and ***: p < 0.001, N.S.: Not Significant.
Figure 6. The quartile values of Bray–Curtis distance (Blue: QIIME1; Red: QIIME2). *: p < 0.05, and ***: p < 0.001, N.S.: Not Significant.
Microorganisms 13 02545 g006
Figure 7. The quartile values of Jaccard distance (Blue: QIIME1; Red: QIIME2). *: p < 0.05, **: p < 0.01, and ***: p < 0.001, N.S.: Not Significant.
Figure 7. The quartile values of Jaccard distance (Blue: QIIME1; Red: QIIME2). *: p < 0.05, **: p < 0.01, and ***: p < 0.001, N.S.: Not Significant.
Microorganisms 13 02545 g007
Figure 8. Venn diagram of genera detected across different pipelines.
Figure 8. Venn diagram of genera detected across different pipelines.
Microorganisms 13 02545 g008
Figure 9. Venn diagram of genera detected with a relative abundance of 1% or higher across different pipelines.
Figure 9. Venn diagram of genera detected with a relative abundance of 1% or higher across different pipelines.
Microorganisms 13 02545 g009
Figure 10. Top 20 fungal genera detected using different pipelines and the results of significance tests between the pipelines. (Ag_g_I_s: Agaricomycetes_gen_Incertae_sedis, Mo_g_I_s: Mortierellaceae_gen_Incertae_sedis, Sa_g_I_s: Saccharomycetales_gen_Incertae_sedis, Po_g_I_s: Polyporaceae_gen_Incertae_sedis, Te_g_I_s: Teratosphaeriaceae_gen_Incertae_sedis, As_g_I_s: Aspergillaceae_gen_Incertae_sedis, Fu_g_I_s: Fungi_gen_Incertae_sedis, Pl_g_I_s: Pleosporales_gen_Incertae_sedis, Au_g_I_s: Auriculariales_gen_Incertae_sedis, Rh_ri: Rhodosporidiobolus, Asc_g_I_s: Ascomycota_gen_Incertae_sedis, Or_g_I_s: Orbiliaceae_gen_Incertae_sedis, and Wa_g_I_s: Wallemiaceae_gen_Incertae_sedis).
Figure 10. Top 20 fungal genera detected using different pipelines and the results of significance tests between the pipelines. (Ag_g_I_s: Agaricomycetes_gen_Incertae_sedis, Mo_g_I_s: Mortierellaceae_gen_Incertae_sedis, Sa_g_I_s: Saccharomycetales_gen_Incertae_sedis, Po_g_I_s: Polyporaceae_gen_Incertae_sedis, Te_g_I_s: Teratosphaeriaceae_gen_Incertae_sedis, As_g_I_s: Aspergillaceae_gen_Incertae_sedis, Fu_g_I_s: Fungi_gen_Incertae_sedis, Pl_g_I_s: Pleosporales_gen_Incertae_sedis, Au_g_I_s: Auriculariales_gen_Incertae_sedis, Rh_ri: Rhodosporidiobolus, Asc_g_I_s: Ascomycota_gen_Incertae_sedis, Or_g_I_s: Orbiliaceae_gen_Incertae_sedis, and Wa_g_I_s: Wallemiaceae_gen_Incertae_sedis).
Microorganisms 13 02545 g010aMicroorganisms 13 02545 g010b
Figure 11. (a) Correlation of Libraries in QIIME1 and QIIME2. (b) Correlation of OTUs/ASVs in QIIME1 and QIIME2.
Figure 11. (a) Correlation of Libraries in QIIME1 and QIIME2. (b) Correlation of OTUs/ASVs in QIIME1 and QIIME2.
Microorganisms 13 02545 g011
Table 1. Summary of measurement targets.
Table 1. Summary of measurement targets.
Sample TypeSample IDNumber of SamplesYearWayTime Until Sequence (Month)
outdoors in summerOutside_S5 (Tokyo), 3 (Aichi prefecture)2019Air sample
3 L/min (60 min)
3
outdoors in winterOutside_W5 (Tokyo), 3 (Aichi prefecture)2020Air sample
3 L/min (60 min)
8
office in summerOffice_S5 (Tokyo), 3 (Aichi prefecture)2019Air sample
3 L/min (60 min)
3
office in winterOffice_W5 (Tokyo), 3 (Aichi prefecture)2020Air sample
3 L/min (60 min)
8
movie theaterTheater182023Swab
100 cm2
3
air conditioner filterAC_Filter172021Swab
50 cm2
4
air conditioner coilAC_Coil172021Swab
50 cm2
4
Table 2. Schematic diagram comparing the two pipelines.
Table 2. Schematic diagram comparing the two pipelines.
PipelineQIIME1QIIME2
Primer trimmingcutadaptqiime cutadapt trim-paired
--errors0.2--p-error-rate 0.2
--trimmed-onlyTRUE--p-discard-untrimmedTRUE
Quality filteringcutadapt dada2 --p-max-ee-f/
--p-max-ee-r
--quality-cutoff20
--minimum-length2
Pooling modeNGmergeqiime dada2 denoise-paired
-m: Minimum overlap of the paired-end reads20--p-trunc-len-f190
-e: Minimum overlap of dovetailed alignments100--p-trunc-len-r180
--p-trim-left-f0
--p-trim-left-r0
--p-max-ee-f2
--p-max-ee-r2
--p-pooling-methoddefault ‘independent’
Chimera handlingusearch--p-chimera-method default ‘consensus’
-strandplus
Classifier trainingpick_de_novo_otus.pyqiime feature-classifier classify-sklearn
default default
Rarefaction depthalpha_rarefaction.pyqiime diversity alpha-rarefaction
--parameter_fpMinimum lead count--p-sampling-depthMinimum lead count
beta_diversity_through_plots.pyqiime diversity core-metrics-phylogenetic
default --p-max-depthMaximum number of leads
Table 3. Output data after processing sequence data with different pipelines.
Table 3. Output data after processing sequence data with different pipelines.
Outside_SOutside_WOffice_SOffice_WTheaterAC_FilterAC_Coil
QIIME1QIIME2QIIME1QIIME2QIIME1QIIME2QIIME1QIIME2QIIME1QIIME2QIIME1QIIME2QIIME1QIIME2
Raw130,977 ± 18,07676,097 ± 8832125,850 ± 20,76276,714 ± 10,17996,637 ± 18,26028,824 ± 722927,349 ± 5948
Filtered130,527 ± 18,037127,603 ± 18,53175,499 ± 875974,641 ± 8592125,412 ± 20,590122,370 ± 21,14176,100 ± 10,14574,525 ± 976195,644 ± 18,01794,877 ± 17,81228,710 ± 720328,634 ± 719027,241 ± 593627,121 ± 5913
Merged108,774 ± 17,679106,199 ± 21,04269,738 ± 828360,153 ± 11,767107,636 ± 15,354106,290 ± 20,53869,894 ± 942859,943 ± 11,88487,468 ± 15,49584,744 ± 14,30027,410 ± 693325,586 ± 648825,895 ± 565824,088 ± 5848
Mean OUT/ASV108,774 ± 17,679104,562 ± 21,26269,738 ± 828359,925 ± 11,608107,636 ± 15,354105,564 ± 20,46369,894 ± 942859,498 ± 11,84887,468 ± 15,49584,313 ± 14,06727,410 ± 693325,453 ± 639625,895 ± 565823,924 ± 5800
Minimum Library74,76261,41159,93738,56389,19481,40157,01336,25952,51153,99515,78215,90018,87015,684
Minimum Library/Raw0.750.620.900.500.810.760.900.420.870.790.920.800.930.77
Maximum Library130,655124,43086,03476,782138,066143,53487,20780,318113,400110,93747,23344,32739,29536,755
Maximum Library/Raw0.870.870.930.880.890.900.930.880.930.940.960.970.970.95
Mean OUTs/ASVs1371 ± 35382 ± 73706 ± 165211 ± 1161195 ± 28058 ± 14707 ± 203158 ± 18751 ± 24977 ± 71620 ± 447243 ± 162311 ± 72104 ± 48
Minimum OUTs/ASVs8572856412496238466118317241994418446
Maximum OUTs/ASVs21032711107505183778120517913593211805555459201
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Watanabe, K.; Yanagi, U. Comparison of QIIME1 and QIIME2 for Analyzing Fungal Samples from Various Built Environments. Microorganisms 2025, 13, 2545. https://doi.org/10.3390/microorganisms13112545

AMA Style

Watanabe K, Yanagi U. Comparison of QIIME1 and QIIME2 for Analyzing Fungal Samples from Various Built Environments. Microorganisms. 2025; 13(11):2545. https://doi.org/10.3390/microorganisms13112545

Chicago/Turabian Style

Watanabe, Kensuke, and U Yanagi. 2025. "Comparison of QIIME1 and QIIME2 for Analyzing Fungal Samples from Various Built Environments" Microorganisms 13, no. 11: 2545. https://doi.org/10.3390/microorganisms13112545

APA Style

Watanabe, K., & Yanagi, U. (2025). Comparison of QIIME1 and QIIME2 for Analyzing Fungal Samples from Various Built Environments. Microorganisms, 13(11), 2545. https://doi.org/10.3390/microorganisms13112545

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop