Next Article in Journal
Emerging Multimodal Strategies for Bacterial Biofilm Eradication: A Comprehensive Review
Previous Article in Journal
Metabolomic Analysis of Antifungal Secondary Metabolites from Achaetomium sophora HY17 in Co-Culture with Botrytis cinerea HM1
Previous Article in Special Issue
The Influence of Acetate and Sodium Chloride Concentration on the Toxic Response of Electroactive Microorganisms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Comprehensive Analysis of Microbial Community and Nitrogen Removal Rate Predictions in Three Anammox Systems

Chinese Research Academy of Environmental Sciences, Beijing 100012, China
*
Author to whom correspondence should be addressed.
Microorganisms 2025, 13(12), 2795; https://doi.org/10.3390/microorganisms13122795
Submission received: 23 October 2025 / Revised: 28 November 2025 / Accepted: 4 December 2025 / Published: 8 December 2025

Abstract

Anammox is a promising approach for biological nitrogen removal, but the differences in microbial community structure across different systems and their response mechanisms to environmental factors remain unclear. In this study, 206 microbial samples and 2126 environmental factor data points from three different anammox systems, including the upflow anaerobic sludge blanket (UASB), integrated fixed-film activated sludge-partial nitritation/anammox (IFAS-PN/A), and integrated fixed-film activated sludge-simultaneous nitrification, anammox and denitrification (IFAS-SNAD), were analyzed using 16S rRNA sequencing analysis, bioinformatics, and machine learning (ML) techniques. The results revealed significant differences in microbial composition among three systems, evidenced by the enrichment of Candidatus_Brocadia in IFAS-PN/A, the high-diversity community in IFAS-SNAD, and the low-diversity communities dominated by Candidatus_Kuenenia in the UASB. Co-occurrence network analysis demonstrated more tightly connected and complex interactions in IFAS-SNAD networks. Machine learning predictions further showed that the stacked model (ST-RF) achieved the highest accuracy in predicting the nitrogen removal rate (NRR), with determination coefficients (R2) exceeding 0.987 across all testing datasets. Moreover, SHapley Additive exPlanations (SHAP) analysis based on the stacked model revealed that the influence of key environmental factors on NRR varied by system type. These results suggested that NRR of different systems depended on the control of key environmental factors, while the significance of these environmental factors was determined by the type of system. Overall, this study enhanced the ecological and functional understanding of anammox-based processes and provided a data-driven framework for optimizing mainstream nitrogen removal.

1. Introduction

Anaerobic ammonia oxidation (anammox) process has emerged as an effective and environmentally sustainable approach for nitrogen removal [1,2,3]. This process is primarily driven by anammox bacteria (AnAOB), which convert ammonium nitrogen (NH4+-N) directly into nitrogen gas (N2) by employing nitrite nitrogen (NO2-N) as the electron acceptor, thereby eliminating the need for aeration or external organic carbon [4,5]. However, AnAOB has been observed to exhibit a relatively slow growth rate, with a doubling time of 10–12 days under optimal conditions [6,7]. And it is known to be sensitive to changes in environmental conditions, including dissolved oxygen (DO), organic carbon levels, ammonium concentrations, and temperature. This sensitivity renders it challenging for AnAOB to be retained and enriched within treatment systems [8]. Additionally, AnAOB competes with rapidly growing nitrite-oxidizing bacteria (NOB) for both substrate and space [9]. These challenges are significant barriers to the implementation of anammox technology in wastewater treatment facilities.
To overcome these barriers, various reactor configurations have been designed, including the upflow anaerobic sludge blanket (UASB), integrated fixed-film activated sludge-partial nitritation/anammox (IFAS-PN/A), and integrated fixed-film activated sludge-simultaneous nitrification, anammox, and denitrification (IFAS-SNAD) systems. Each configuration creates distinct ecological conditions that influence microbial enrichment and nitrogen transformation pathways. Among these, the IFAS-PN/A process enhances nitrogen removal by selectively retaining AnAOB in biofilms while suppressing NOB through floc discharge [10,11]. Yang et al. [12] established IFAS-PN/A reactor with a stable nitrogen removal rate (NRR) of 2.0–2.5 kg N/(m3·d), with 85% of nitrogen removal efficiency (NRE), attributing this performance to the combined enrichment of ammonium-oxidizing bacteria (AOB) and AnAOB in both flocs and biofilms. Further, Yang et al. [13] demonstrated that IFAS-PN/A system enhanced both the abundance and metabolic activity of functional microorganisms, effectively limiting nitrate accumulation and attaining a maximum NRR of 1.5 kg N/(m3·d) with an NRE of 82.3%, in which Candidatus_Brocadia was identified as the dominant anammox genus. Building upon the SNAD process, the IFAS-SNAD process enhanced nitrogen removal through the synergistic action of simultaneous nitrification, anammox, and denitrification. Recent studies have clarified the underlying mechanisms of this synergy. For instance, Wang et al. [14] and Du et al. [15] demonstrated that AnAOB were predominantly enriched in the biofilm, working synergistically with heterotrophic denitrifiers and AOB in the suspended sludge. This partitioning enabled high total nitrogen (TN) removal efficiencies (72–92.8%) at optimized operational conditions, showcasing the highly efficient nitrogen removal performance of the IFAS-SNAD system. In contrast to the two IFAS systems, UASB reactor is extensively applied for conducting the anammox process because its sludge readily forms dense granules, facilitating biomass retention and typically resulting in a higher NRR [16,17,18,19]. This system excels at fostering a dominant anammox community, as evidenced by Li et al. [20], where Candidatus_Kuenenia proliferated from 0.01% to over 50% abundance, enabling exceptional nitrogen removal efficiencies exceeding 99.6%. Although the performance of anammox sludge formed in these systems has been studied, a systematic comparison of microbial community structures across these systems remains limited.
Predicting nitrogen removal performance in biological processes is difficult because of the interactions between microbial communities and surrounding environmental conditions. Machine learning (ML), recognized as an advanced technique within data mining, has captured considerable interest recently. Previous studies have primarily applied single models to predict nitrogen removal [21,22,23]. For example, a meta-analysis across wastewater treatment plants worldwide used six ML models to predict nitrogen removal, and found that Extreme Gradient Boosting (XGBoost) yielded the highest R2 among them, exceeding 0.8 [24]. Another study by Mu et al. [25] showed that while Artificial Neural Network (ANN) and XGBoost demonstrated strong predictive capabilities for NRR and functional microbial abundance, respectively, Support Vector Regression (SVR) performed less effectively, because it struggled to represent intricate nonlinear patterns and was prone to interference from redundant variables. However, single models may be overly sensitive to characteristics and noise in the training data, leading to overfitting issues. Stacked models, which integrate multiple base learners, are regarded as a promising approach by mitigating overfitting and enhancing predictive stability and accuracy [26]. Over the last decade, model stacking has found widespread application across various domains [27,28,29]. Research by Lukas et al. has demonstrated that stacked models exhibit greater stability in small datasets compared to individual ML models [30]. Nonetheless, the application of a stacked model to simulate nitrogen removal process has been relatively uncommon.
Therefore, this study aims to systematically compare the microbial community structures and microbial networks across three anammox systems (IFAS-PN/A, IFAS-SNAD, and UASB) using 16S rRNA gene sequences based on big-data mining, while a stacked model was employed to predict NRR and interpret the key influencing environmental factors through SHapley Additive exPlanations (SHAP). By integrating multiple analytical approaches and machine learning techniques for high throughput sequencing interpretation, this study enhances the precision and efficiency of NRR prediction and offers valuable insights to optimize nitrogen removal processes in wastewater treatment systems.

2. Materials and Methods

2.1. Collection and Processing of Literature Data

Studies published between 2016 and 2025 were retrieved from Web of Science using keyword combinations of “anammox” with “UASB,” “integrated fixed-film activated sludge (IFAS),” “partial nitritation/anammox (PN/A),” or “simultaneous nitrification, anammox, and denitrification (SNAD).” Only Illumina-based 16S rRNA amplicon datasets with publicly available FASTQ files, complete metadata, and clearly reported primers were included. A screening diagram is provided in Supplementary Figure S1. Comprehensive metadata and accession numbers associated with sequencing datasets collected from National Center for Biotechnology Information (NCBI) Sequence Read Archive are listed in Supplementary Table S1 [31]. A total of 206 samples were collected, including IFAS-PN/A (55 samples), IFAS-SNAD (86 samples), and UASB (65 samples).
Machine learning approaches were additionally utilized in this study to determine which environmental factors exerted significant effects on NRR. In total, 2126 environmental factor data points from the figures of the aforementioned studies were collected in the three systems using Origin software (version 2022) (see Supplementary Materials), including 584 from the IFAS-PN/A system, 420 from the IFAS-SNAD system, and 1122 from the UASB system. Environmental factor data were collected for different reaction systems. At the same time, the corresponding operational variables were sorted out. To minimize the impact of random errors on the extracted data, three extractions were performed at each sampling point, with the final average value used as the experimental analysis data. In this study, the nitrogen loading rate (NLR) and NRR were determined based on the formulas presented in Equations (1) to (4).
T N i n f m g / L = N H 4 + N i n f + N O 2 N i n f + N O 3 N i n f
T N e f f m g / L = N H 4 + N e f f + N O 2 N e f f + N O 3 N e f f
N L R kgN m 3 d = T N inf 1000 × 24 HRT
N R R kgN m 3 d = T N inf T N e ff 1000 × 24 HRT
where NH4+-Ninf, NO2-Ninf, NO3-Ninf, NH4+-Neff, NO2-Neff, and NO3-Neff are the nitrogen concentrations in mg/L of the influent and effluent, respectively.

2.2. Bioinformatics Analysis

All 16S rRNA gene sequences collected from the corresponding literature were grouped by primer type. Each group independently applied Cutadapt v.5.0 [32] and DADA2 plugin [33] to finish removing primers and quality control in QIIME 2 (version 2025.4.0) [34]. Quality control thresholds were enforced during this process. Sequences were filtered using a minimum Phred quality score of 33. Paired-end reads were retained only if the merging success rate exceeded 90%. After processing, representative sequences and feature tables from different groups were merged using the merge-seqs and merge functions of the QIIME 2 plugin. The resulting consolidated dataset was then subjected to taxonomic classification of each amplicon sequence variant (ASV) using the Silva 138 Classifier [35]. Bacterial composition profiles at genus level were subsequently compiled for integrated statistical analyses and visualization. The finalized ASV table formed the basis for later machine learning modeling development.

2.3. Diversity Analysis, Network Analysis and Data Visualization

Alpha diversity across the distinct treatment systems was evaluated in R (version 4.5.0) using the vegan v2.6-8, ggplot2 v3.5.1, and agricolae v1.3-7 packages to visualize and compare species richness and Simpson Diversity index. Statistical significance among groups was determined through the Mann–Whitney U test. Community structure differentiation was further examined via Principal Coordinates Analysis (PCoA) and Adonis tests based on Bray–Curtis dissimilarities, both executed with the vegan package.
Co-occurrence networks were generated in R (version 4.5.0) utilizing the igraph v2.1.4, Hmisc v5.2.3, and ggplot2 v3.5.1 packages. To eliminate low-frequency ASVs, only those representing more than 0.01% of the total sequences and appearing in over 20% of samples were retained. Pairwise Spearman’s correlations between ASVs were calculated, with a correlation coefficient > |0.6| and a p value < 0.01 (Benjamini and Hochberg adjusted) being considered as a valid relationship [36]. The network-level (average clustering coefficient, modularity, density, and average degree) and node-level (degree, nodes, and edges) topological features of a network were calculated. These networks were visualized using Gephi (version 0.10.1) and were accordingly named IFAS-PN/A, IFAS-SNAD, and UASB.

2.4. Machine Learning Model

Stacking is a specific type of ensemble machine learning algorithm that can produce a strong learner from poorer ones [37]. To enhance overall performance, this hierarchical approach employs a meta model that synthesizes the outcomes of base models, thereby improving the accuracy and generalization capability of the model. The detailed architecture of the model is shown in Figure 1. The dataset collected was split into training and test sets. Base learners were trained on the training set and their results on the training set served as inputs to a meta learner, which was used to train the final meta learner. After model training, the testing set served as inputs to the meta learner, which generated the final labels. Model performance and hyperparameters were optimized using grid search with five-fold cross-validation, ensuring high accuracy and robustness in practical applications.
In the study, datasets for IFAS-PN/A (n = 584), IFAS-SNAD (n = 420), and UASB (n = 1122) were each divided into training and testing sets at a ratio of 7:3, adhering to the no-peeking principle to prevent data leakage [38,39]. Categorical Boosting (CatBoost), Adaptive Boosting (AdaBoost), Light Gradient Boosting Machine (LGBM), Gradient Boosting Machine (GBM), and SVR were used as base learners, while XGBoost and Random Forest (RF) were served as meta learners to generate two stacked models, namely Stacked-XGBoost (ST-XG) and Stacked-RF (ST-RF). Model performance was evaluated using determination coefficients (R2), root mean square error (RMSE), and mean absolute error (MAE) according to Equations (5) to (7), with hyperparameters optimized via five-fold cross-validation and grid search [40,41]. Regularization was applied to all base learners, and outlier problems were implicitly addressed through robust model design and cross-validation.
R 2 = 1 i = 1 N p i p i ^ 2 i = 1 N p i p ¯ 2
R M S E = 1 N i = 1 N p i p i ^ 2
M A E = 1 N i = 1 N p i p i ^
where p i ^ is the predicted NRR value, p i is the actual NRR value, and N is the number of sampling points.

2.5. Feature Importance Analysis and Data Visualization

To interpret the contributions of individual environmental factors in the stacked model, SHAP v.0.48.0 was employed. For each base learner in the first layer, SHAP values were computed using TreeExplainer for tree-based models (LGBM, GBM, AdaBoost, and CatBoost) and KernelExplainer for non-tree models (SVR). In the second layer, the meta learner was explained by computing SHAP values for the meta-features, which are the predictions generated by the base learners, thereby quantifying each base learner’s contribution to the final prediction. Finally, global SHAP values were calculated to assess the overall influence of original input features on the stacked model’s output. The data were visualized using the matplotlib module (version 3.10.6) in Python (version 3.13.7).

3. Results and Discussion

3.1. Distinct Microbial Community Diversity and Structures in Three Anammox Systems

Microbial community composition across three distinct anammox systems was investigated using a total of 206 16S rRNA gene sequencing datasets obtained from 19 previous studies.
As shown in Figure 2a, the Mann–Whitney U test revealed significant differences in the Simpson Diversity index in three systems (p < 0.0001). Among them, the IFAS-SNAD system exhibited the highest average Simpson Diversity index of 0.96, suggesting a greater diversity in this system’s microbial community. In addition, the IFAS-PN/A and UASB systems showed a lower average Simpson Diversity index of 0.86 and 0.82, respectively. In addition, the Sobs index provided a more intuitive representation of the differences in microbial richness across the three systems, and the IFAS-PN/A and UASB systems had lower microbial richness with significant differences compared to IFAS-SNAD (p < 0.0001). These findings illustrated that higher microbial diversity as well as richness were possessed by the IFAS-SNAD microbial community compared with IFAS-PN/A and UASB systems. In addition, principal coordinates analysis (PCoA) showed the difference in microbial community composition among three systems at the phylum and genus levels (Figure 2b,c). The Adonis test revealed that the differences in microbial community structure were particularly significant at two levels (p = 0.001), with more significant differences observed at the genus level.

3.2. Differential Abundance of Taxonomic Compositions

A total of 825, 1201, and 931 genera were detected in IFAS-PN/A, IFAS-SNAD, and UASB systems, respectively (Figure 3a). IFAS-SNAD system exhibited the highest number of unique bacterial genera (354), while UASB system had the lowest (115). Additionally, 575 genera were found to be common across all three systems, suggesting their widespread presence within these systems. Only a limited number of genera overlapped between different systems, highlighting the distinctive microbial community structures characteristic of each treatment process.
Furthermore, the top ten phyla and the top ten genera with the highest relative abundance in three different systems were presented in Figure 3b,c. At the phylum level, microbial communities were primarily composed of Proteobacteria (28.1–29.4%), Chloroflexi (16.8–28.4%), Bacteroidota (9.3–20.3%), and Planctomycetota (7.5–24.6%), which collectively accounted for more than 74.1% of total abundance. Among these, the Proteobacteria with denitrification and nitrification capabilities and the Bacteroidetes with the ability to degrade complex organic matter exhibited the highest relative abundances in the IFAS-SNAD system [42,43], at 29.4% and 20.3%, respectively. The high richness of these two phyla was consistent with the SNAD process integrating nitrification, anammox, and denitrification [44]. Specifically, the increased richness of these phyla was attributed to the presence of trace DO and more complex organic matter environments, enabling functions such as simultaneous denitrification within the IFAS-SNAD system [45]. Chloroflexi had been found to exhibit high relative abundance in the three systems. The higher abundance of Chloroflexi could be attributed to their role in scavenging macromolecules derived from dead anammox biomass and these filamentous bacteria, thereby maintaining and strengthening the structural integrity of anammox consortia [46,47,48]. Planctomycetota, which contains AnAOB, was found to account for 10.2%, 7.5%, and 24.6% in IFAS-PN/A, IFAS-SNAD, and UASB systems, respectively. Notably, Planctomycetota was significantly more abundant in the UASB system (24.6%) compared to the other two systems, reflecting that nitrogen removal of anammox process accounted for a larger proportion in the UASB system.
At the genus level, notable differences were observed in the composition and abundance of AnAOB in the three systems. In IFAS-PN/A system, Candidatus_Brocadia (5.0%) was the most abundant, while the UASB system was dominated by Candidatus_Kuenenia (19.0%). However, the relative abundance of Candidatus_Brocadia (1.2%) and Candidatus_Kuenenia (0.4%) in IFAS-SNAD system was lower compared to the other two systems. It was attributed to simultaneous occurrence of nitrification, anammox, and denitrification in the IFAS-SNAD system, which promoted the growth of bacteria such as nitrifying bacteria and denitrifying bacteria, leading to a reduced relative abundance of AnAOB. SBR1031 and Denitratisoma were notably abundant in both UASB and IFAS-PN/A systems. Previous studies showed that SBR1031 participated in organic matter degradation [49], while Denitratisoma, a well-defined denitrifier, cooperated with AnAOB for nitrogen removal in anammox systems [50], indicating their key roles in both systems. The unique enrichment of PHOS-HE36 in the IFAS-SNAD system, accounting for approximately 4.53%, was noteworthy. It has been speculated that these obligate heterotrophic bacteria might be involved in the denitrification process [51]. And the relative abundance of Nitrospira (NOB) was also higher than in the other two systems. This observation could be explained by the evolutionary background of Nitrospira, which originated from microaerophilic or anaerobic ancestors, and these organisms possess the ability to utilize simple organic molecules, enabling a mixotrophic mode of growth when substrate availability is restricted [52]. Furthermore, the high abundance of uncultured genus across all systems revealed the potential diversity yet to be functionally characterized [53].

3.3. Microbial Network Analysis

3.3.1. Network Topological Features

Network analysis was conducted to investigate the distinct interactions within three anammox systems (Figure 4). The network topological features of the three systems were provided in Figure 4a. The resulting networks contained 157–333 nodes and 1010–6104 edges with the average degree (avgK) from 12.866 to 36.661. The avgK, representing the average number of connections per node, was highest in the IFAS-SNAD network (36.661), while lowest in the UASB network (12.866). Both the average clustering coefficient (avgCC) and density, which measure network compactness, were lowest in the UASB system and highest in the IFAS-SNAD system. In addition, the modularity, which indicates the strength of division of a network into modules, was the highest in the UASB network (0.495), while the lowest was in the IFAS-SNAD network (0.391). These observations suggested that the IFAS-SNAD system possessed a more tightly connected and complex symbiotic network. In contrast, the UASB system showed a simpler and more modular network. These differences in features directly reflected the variations in microbial interaction among the three systems.

3.3.2. Network Structure in the Three Systems

The constructed three microbial networks were visualized in Figure 4c, offering insights into the different network structure across three systems. The majority of network nodes were taxonomically classified within seven primary phyla, including Proteobacteria (26.64–30.03%), Bacteroidota (10.51–16.56%), Chloroflexi (8.71–12.74%), and Planctomycetota (4.46–8.71%), among others. The proportion of individual phyla represented in the networks varied notably from their relative abundances in the corresponding microbial communities. In particular, nodes belonging to Planctomycetota in the UASB networks only accounted for 4.46%, a marked contrast to their relative abundance (24.6%) in the microbial community. While the proportion of nodes related to Proteobacteria remained consistent at approximately 45.8% across three networks, suggesting the potential dominant function of Proteobacteria in maintaining the structural stability of anammox system networks.
Furthermore, 89 nodes were found to be shared among the three microbial networks (Figure 4b), with the majority belonging to Proteobacteria (23.8%), Bacteroidota (20.4%), and Chloroflexi (15.9%). And 65, 165, and 15 distinct nodes were observed in the IFAS-PN/A, IFAS-SNAD, and UASB networks, respectively. The networks of the two IFAS systems exhibited higher node diversity than the UASB system, consistent with the conclusions from the aforementioned microbial community analysis. These findings indicated the two IFAS microbial networks exhibited greater complexity and species diversity than the UASB network. The three networks comprised 10,504 edges. Of them, 93.4% of the edges (9809 edges) appeared only once across the three networks, which could be random or temporary correlations between species, displaying a significant difference in interactions among networks. Only 11 edges were present across three systems, and 9 of 11 edges reflected positive correlations. Moreover, the four related taxa (ASVs 130, 313, 353, and 358) involved in these nine positive connections were all derived from the same module, suggesting particularly strong interrelationships among them.
In addition, three anammox systems were all dominated by positive relationships, accounting for 95.04% (IFAS-PN/A), 79.96% (IFAS-SNAD), and 97.92% (UASB) of the total edges. It indicated that cooperative relationships were predominant in the three networks. And the resulting three networks generated 7, 8, and 6 modules with the modularity of 0.454, 0.391, and 0.495, respectively, illustrating that microorganisms tended to cluster together to maintain the stability of the network structure. It was also found that the M2 (IFAS-PN/A), M1 (IFAS-SNAD), and M1 (UASB) were the largest modules in these three networks, containing 102, 149, and 82 nodes, respectively (Figure S2). Among them, Candidatus_Brocadia, SM1A02, and Candidatus_Kuenenia, regarded as typical or potential AnAOB, were observed in the IFAS-PN/A, IFAS-SNAD, and UASB networks, respectively, suggesting that these modules performed the primary function of the anammox process and AnAOB required close interactions with other species during this process. Furthermore, the proportion of positive correlations between AnAOB and other taxa was lower in the IFAS-SNAD system (55.93%) compared to the IFAS-PN/A system (91.76%) and the UASB system (72.22%). The IFAS-SNAD network exhibited higher taxonomic competition, consistent with the fact that the organic environment favored the rapid proliferation of heterotrophic bacteria, which might compete with AnAOB for substrates and thereby inhibit their growth [54].

3.4. The Prediction of the Impact of Environmental Factors on NRR Using ML

In this study, datasets from three systems (IFAS-PN/A, IFAS-SNAD, and UASB) were used for model training and testing, and model performance was evaluated using statistical metrics. The stacked models combined with SHAP analysis were used to predict the nitrogen removal rate (NRR) and to interpret the influence of environmental factors on NRR.

3.4.1. Evaluation of Model Performance

The evaluation metrics for the training and testing datasets of five base models and the stacked models ST-RF and ST-XGB were showed in Table 1. For the IFAS-PN/A testing dataset, the R2, MAE, and RMSE values of individual ML models ranged from 0.787 to 0.988, 0.060 to 0.217, and 0.090 to 0.378, respectively, while the average values of the corresponding metrics for stacked models were 0.990, 0.041, and 0.091, respectively. It was found that the performance of the stacked models was superior to that of the individual ML models, with the best model, ST-RF, showing R2, MAE, and RMSE values of 0.990, 0.042, and 0.081, respectively. Similar results were observed in the testing datasets in IFAS-SNAD and UASB systems. Furthermore, the model evaluation metrics for the training datasets’ performance comparison across the three systems aligned with the aforementioned testing dataset results. These findings collectively demonstrated that the predicted NRR values from the ST-RF model consistently agreed with the actual NRR values (Figure 5), achieving significantly higher prediction accuracy than other models. The stacked models ST-RF effectively enhanced prediction precision and reduced prediction errors.

3.4.2. Identification of Key Environmental Factors

Given the higher predictive accuracy of the ST-RF model, the SHAP method was applied to interpret the ST-RF model and obtain the importance ranking of each environmental factor in the three systems, with the results shown in Figure 6.
In IFAS-PN/A system, the five features with the greatest impact on NRR were NLR, PH, chemical oxygen demand (COD), NH4+-Ninf, and DO. Specifically, pH and DO regulated the selection of AOB and NOB and thus determined the availability of nitrite [55], whereas NH4+-Ninf and NLR were directly related to substrate supply. At the same time, COD needed to be maintained within an optimal range that permitted limited heterotrophic activity to support denitrification while avoiding suppression of autotrophic anammox pathways [56]. In IFAS-SNAD system, the five features with the greatest impact on NRR were NH4+-Ninf, hydraulic retention time (HRT), NLR, temperature, and COD. Among these, appropriate HRT and temperature provided the necessary conditions for the concurrent expression of these pathways. Meanwhile, NH4+-Ninf and NLR determined the nitrite supply required to sustain both nitrification and anammox activity. Compared with the two IFAS systems, NRR in UASB system was influenced not only by NLR and HRT but also by influent concentrations (NH4+-Ninf and NO2-Ninf). A previous study reported that the performance and stability of UASB-anammox systems were mainly controlled by substrate concentration and hydraulic conditions, with excessive nitrogen loading rates or nitrite levels beyond tolerance thresholds causing substrate inhibition and reduced nitrogen removal efficiency [57]. These results suggested that NRR of different systems depended on the control of key environmental factors, and the importance of influencing factors depended on the system type.

3.5. Impacts of Environmental Factors on Bacterial Communities

Based on the analysis of microbial community differences and the environmental factor importance ranking derived using SHAP in the IFAS-PN/A, IFAS-SNAD, and UASB systems, specific environmental factors were found to be associated with bacterial community composition changes in the three systems.
In the IFAS-PN/A system, SHAP analysis identified NLR, pH, COD, NH4+-Ninf, and DO as the most influential environmental factors affecting NRR. Biologically, slightly low DO and optimal pH favor partial nitritation by suppressing NOB, thereby ensuring a steady nitrite supply to AnAOB. NH4+-Ninf and NLR directly determined substrate supply, and moderate NLR could promote the selective enrichment of AnAOB, whereas excessive NLR could cause the inhibition of their growth [58]. In addition, COD concentration also served as a pivotal regulatory factor. A previous study has found that heterotrophic denitrifiers could compete with AnAOB for nitrite under high COD conditions, thereby inhibiting the activity of AnAOB and reducing nitrogen removal efficiency. Even when AnAOB remain dominant, this competition might drive their metabolism toward organic substrate utilization rather than ammonium conversion, primarily impairing nitrogen metabolism [59]. The high relative abundance of Candidatus_Brocadia (5.0%) in this system was a direct consequence of regulation by these environmental factors. Previous studies demonstrated that Candidatus_Brocadia had the ability to thrive in partial nitrification environments [60,61], and network analysis results also revealed Candidatus_Brocadia occupied a central position in the largest network module and primarily exhibited cooperative relationships with other taxa, highlighting the important role in IFAS-PN/A system.
In the IFAS-SNAD system, the SHAP revealed that NRR was dependent on NH4+-Ninf, HRT and NLR, while other features, such as COD and temperature, also exerted a certain degree of influence on the NRR. This finding reflected the multifaceted nature of the SNAD process, which required a balance between aerobic, anoxic, and anaerobic conditions to support simultaneous nitrification, denitrification, and anammox. A previous study had demonstrated that intermittent aeration and organic load control were crucial for maintaining the balance of nitrification, anammox, and denitrification processes within SNAD systems, while also highlighting the importance of biofilm carriers for retaining slow-growing AnAOB [62]. This balance was further reflected in the distinct microbial community structure. The relatively long HRT supported the retention and enrichment of AnAOB while provided sufficient contact time for the growth of heterotrophic denitrifiers. Especially, the enrichment of denitrifying bacteria PHOS-HE36 and nitrifying Nitrospira (NOB) ensured a sufficient supply of nitrite for the anammox process. Network analysis further revealed that AnAOB appeared in the largest module, and exhibited competitive relationships with most taxa. This might be attributed to competition for substrates between heterotrophic bacteria and AnAOB, thereby inhibiting the growth of AnAOB. Thus, these environmental factors could directly influence the microbial community, which in turn mediated the nitrogen removal performance of the SNAD process through complex microbial interactions.
In the UASB system, the model indicated a strong dependence of NRR on NLR, with other environmental factors showing less impact. The overwhelming dominance of NLR as the primary environmental driver would impose extreme selective pressure that led to the enrichment of dominant microorganisms. It was consistent with the significantly higher relative abundance of Candidatus_Kuenenia compared to other species in this study. A similar finding has been obtained from an earlier investigation, for example, Candidatus_Kuenenia exhibited a high rate of protein synthesis and metabolic activity, making it particularly well-suited for high NLR and rapid growth rates [63]. According to the research conducted by Kim et al. [64], the granular UASB system as a nitrogen removal system showed excellent performance compared with the biofilm system at high NLR, further substantiating the efficient denitrification capabilities of UASB system. Our preceding analyses of species diversity and network structure also supported this. The UASB system exhibited low microbial diversity with a high abundance of AnAOB, and the corresponding microbial network was characterized by low modularity and fewer nodes, indicating a highly specialized anammox process within the system.
In summary, different environmental parameters significantly influenced the microbial community structure, which directly determined the functional characteristics of each reactor system. IFAS-SNAD system was shaped by coordinated regulation of multiple environmental factors. This system exhibited higher microbial richness and more complex, connected co-occurrence network, thereby enhancing functional stability and promoting synergistic nitrogen removal through the SNAD pathway. In contrast, the IFAS-PN/A system exhibited intermediate diversity and network complexity, which was consistent with its functional focus on the synergistic interaction between AOB and Candidatus_Brocadia, maintained by the factors such as DO and pH. Meanwhile, the low-diversity but highly modular network in the UASB system reflected a specialized community dominated by Candidatus_Kuenenia, which could enhance the NRR under high NLR but might be more vulnerable to environmental factors.

4. Conclusions

This study systematically compared the microbial community structure, microbial networks, and nitrogen removal performance across three anammox systems (UASB, IFAS-PN/A, and IFAS-SNAD). The results revealed different microbial diversity and composition among the systems, with IFAS-SNAD exhibiting the highest diversity and functional complexity, while IFAS-PN/A and UASB displayed lower diversity and were dominated by Candidatus_Brocadia and Candidatus_Kuenenia, respectively. Network analysis further indicated that IFAS-SNAD possessed a more complex microbial interaction network, whereas UASB showed a simpler network. Furthermore, ML analysis showed that stacked models consistently outperformed single models in predicting NRR accuracy, with the ST-RF model demonstrating the highest predictive precision (R2 > 0.987). And SHAP analysis demonstrated that the impact of different environmental factors on NRR depended on the type of system. Notably, the operational differences across the three anammox systems reflect practical application diversity, and these differences were fully accounted for by identifing system-specific environmental factors using ML and SHAP analysis.
Overall, the bioinformatics and ML methods are conducted in this study to examine variations in microbial communities among the three systems and to assess how environmental parameters influence NRR. The integration of high-throughput sequencing with ML presents a powerful and data-driven pathway for enhancing water pollution control, providing theoretical support for constructing more efficient ecological engineering systems for wastewater treatment. However, the model in this study has only been internally validated, and external validation is recommended to further confirm its generalizability and practical applicability.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/microorganisms13122795/s1, Figure S1. Literature Search Diagram.; Figure S2. Microbial network analysis of ASVs across different systems organized by different modules. Each node represents an ASV, with node size proportional to its degree and color denoting different modules.; Table S1. Information of three types of systems analyzed in this study; Supplementary Material S2. Environmental factor data collected for three systems in this study. Refs. [65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80] can be found in Supplementary Materials.

Author Contributions

Visualization, X.Z.; software, X.Z.; writing—original draft, X.Z.; writing—review and editing, X.Z., T.Y. and L.H.; investigation, W.L.; methodology, W.L.; resources, L.H.; validation, L.H.; funding acquisition, L.H.; supervision, L.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National key research and development plan (2022YFC3203401).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AnammoxAnaerobic ammonia oxidation
NH4+-NAmmonium nitrogen
NO2-NNitrite nitrogen
N2Nitrogen gas
IFASIntegrated fixed-film activated sludge
PN/APartial nitritation/anammox
SNADSimultaneous nitrification, anammox, and denitrification
UASBUpflow anaerobic sludge blanket
IFAS-PN/AIntegrated fixed-film activated sludge-partial nitritation/anammox
IFAS-SNADIntegrated fixed-film activated sludge-simultaneous nitrification, anammox, and denitrification
MLMachine learning
SHAPSHapley Additive exPlanations
PCoAPrincipal Coordinates Analysis
NCBINational Center for Biotechnology Information
ASVAmplicon sequence variant
AnAOBAnammox bacteria
AOBAmmonium-oxidizing bacteria
NOBNitrite-oxidizing bacteria
DODissolved oxygen
TNTotal nitrogen
HRTHydraulic retention time
CODChemical oxygen demand
ANNArtificial Neural Network
SVRSupport Vector Regression
LGBMLight Gradient Boosting Machine
RFRandom Forest
XGBoostExtreme Gradient Boosting
CatBoostCategorical Boosting
AdaBoostAdaptive Boosting
GBMGradient Boosting Machine
ST-XGStacked-XGBoost
ST-RFStacked-RF
NRRNitrogen Removal Rate
NLRNitrogen Loading Rate
NRENitrogen removal efficiency
R2Determination coefficients
RMSERoot mean square error
MAEMean absolute error
avgKAverage degree
avgCCAverage clustering coefficient

References

  1. Adams, M.; Issaka, E.; Chen, C. Anammox-Based Technologies: A Review of Recent Advances, Mechanism, and Bottlenecks. J. Environ. Sci. 2025, 148, 151–173. [Google Scholar] [CrossRef]
  2. Jetten, M.S.M.; Wagner, M.; Fuerst, J.; Van Loosdrecht, M.; Kuenen, G.; Strous, M. Microbiology and Application of the Anaerobic Ammonium Oxidation (‘Anammox’) Process. Curr. Opin. Biotechnol. 2001, 12, 283–288. [Google Scholar] [CrossRef] [PubMed]
  3. Lackner, S.; Gilbert, E.M.; Vlaeminck, S.E.; Joss, A.; Horn, H.; Van Loosdrecht, M.C.M. Full-Scale Partial Nitritation/Anammox Experiences–an Application Survey. Water Res. 2014, 55, 292–303. [Google Scholar] [CrossRef] [PubMed]
  4. Terada, A.; Zhou, S.; Hosomi, M. Presence and Detection of Anaerobic Ammonium-Oxidizing (Anammox) Bacteria and Appraisal of Anammox Process for High-Strength Nitrogenous Wastewater Treatment: A Review. Clean Technol. Environ. Policy 2011, 13, 759–781. [Google Scholar] [CrossRef]
  5. Kuenen, J.G. Anammox and Beyond. Environ. Microbiol. 2020, 22, 525–536. [Google Scholar] [CrossRef]
  6. Ibrahim, M.; Yusof, N.; Mohd Yusoff, M.Z.; Hassan, M.A. Enrichment of Anaerobic Ammonium Oxidation (Anammox) Bacteria for Short Start-up of the Anammox Process: A Review. Desalination Water Treat. 2016, 57, 13958–13978. [Google Scholar] [CrossRef]
  7. Strous, M.; Heijnen, J.J.; Kuenen, J.G.; Jetten, M.S.M. The Sequencing Batch Reactor as a Powerful Tool for the Study of Slowly Growing Anaerobic Ammonium-Oxidizing Microorganisms. Appl. Microbiol. Biotechnol. 1998, 50, 589–596. [Google Scholar] [CrossRef]
  8. Jetten, M.S.M.; Strous, M.; Van De Pas-Schoonen, K.T.; Schalk, J.; Van Dongen, U.G.J.M.; Van De Graaf, A.A.; Logemann, S.; Muyzer, G.; Van Loosdrecht, M.C.M.; Kuenen, J.G. The Anaerobic Oxidation of Ammonium. FEMS Microbiol. Rev. 1998, 22, 421–437. [Google Scholar] [CrossRef]
  9. Abdul Rahiman, S.; Qiblawey, H. Anammox-MBR Technology: Breakthroughs and Challenges in Sustainable Nitrogen Removal from Wastewater. Membranes 2025, 15, 337. [Google Scholar] [CrossRef]
  10. Laureni, M.; Weissbrodt, D.G.; Villez, K.; Robin, O.; De Jonge, N.; Rosenthal, A.; Wells, G.; Nielsen, J.L.; Morgenroth, E.; Joss, A. Biomass Segregation between Biofilm and Flocs Improves the Control of Nitrite-Oxidizing Bacteria in Mainstream Partial Nitritation and Anammox Processes. Water Res. 2019, 154, 104–116, Erratum in Water Res. 2020, 169, 115133. [Google Scholar] [CrossRef]
  11. Vlaeminck, S.E.; Terada, A.; Smets, B.F.; De Clippeleir, H.; Schaubroeck, T.; Bolca, S.; Demeestere, L.; Mast, J.; Boon, N.; Carballa, M.; et al. Aggregate Size and Architecture Determine Microbial Activity Balance for One-Stage Partial Nitritation and Anammox. Appl. Environ. Microbiol. 2010, 76, 900–909. [Google Scholar] [CrossRef]
  12. Yang, S.; Peng, Y.; Zhang, L.; Zhang, Q.; Li, J.; Wang, X. Autotrophic Nitrogen Removal in an Integrated Fixed-Biofilm Activated Sludge (IFAS) Reactor: Anammox Bacteria Enriched in the Flocs Have Been Overlooked. Bioresour. Technol. 2019, 288, 121512. [Google Scholar] [CrossRef] [PubMed]
  13. Yang, J.; Zhang, L.; Sun, H.; Sun, Z.; Li, J. Synergistic Membrane-Biofilm-Sludge System Coupling Partial Nitritation and Anammox: Achieving Efficient Nitrogen Removal in High-Ammonia/Low-Carbon Condensate Wastewater. Bioresour. Technol. 2025, 434, 132819. [Google Scholar] [CrossRef]
  14. Wang, C.; Liu, S.; Xu, X.; Zhang, C.; Wang, D.; Yang, F. Achieving Mainstream Nitrogen Removal through Simultaneous Partial Nitrification, Anammox and Denitrification Process in an Integrated Fixed Film Activated Sludge Reactor. Chemosphere 2018, 203, 457–466. [Google Scholar] [CrossRef] [PubMed]
  15. Du, Y.; Yu, D.; Wang, X.; Zhen, J.; Bi, C.; Gong, X.; Zhao, J. Achieving Simultaneous Nitritation, Anammox and Denitrification (SNAD) in an Integrated Fixed-Biofilm Activated Sludge (IFAS) Reactor: Quickly Culturing Self-Generated Anammox Bacteria. Sci. Total Environ. 2021, 768, 144446. [Google Scholar] [CrossRef] [PubMed]
  16. Ma, B.; Peng, Y.; Zhang, S.; Wang, J.; Gan, Y.; Chang, J.; Wang, S.; Wang, S.; Zhu, G. Performance of Anammox UASB Reactor Treating Low Strength Wastewater under Moderate and Low Temperatures. Bioresour. Technol. 2013, 129, 606–611. [Google Scholar] [CrossRef]
  17. Tang, C.J.; Zheng, P.; Wang, C.H.; Mahmood, Q.; Zhang, J.Q.; Chen, X.G.; Zhang, L.; Chen, J.W. Performance of High-Loaded ANAMMOX UASB Reactors Containing Granular Sludge. Water Res. 2011, 45, 135–144. [Google Scholar] [CrossRef]
  18. Adams, M.; Xie, J.; Kabore, A.W.J.; Chang, Y.; Xie, J.; Guo, M.; Chen, C. Research Advances in Anammox Granular Sludge: A Review. Crit. Rev. Environ. Sci. Technol. 2022, 52, 631–674. [Google Scholar] [CrossRef]
  19. Abma, W.R.; Schultz, C.E.; Mulder, J.W.; Van Der Star, W.R.L.; Strous, M.; Tokutomi, T.; Van Loosdrecht, M.C.M. Full-Scale Granular Sludge Anammox Process. Water Sci. Technol. 2007, 55, 27–33. [Google Scholar] [CrossRef] [PubMed]
  20. Li, X.; Yao, J.; Jia, Y.; Liu, J.; Chen, Y. Initiation of Anammox in an Up-Flow Anaerobic Sludge Bed Reactor: Bacterial Community Structure, Nitrogen Removal Functional Genes, and Antibiotic Resistance Genes. Water 2024, 16, 3426. [Google Scholar] [CrossRef]
  21. Alvarado, V.; Ying, L.; Asghari, V.; Hsu, S.-C.; Lee, P.-H. Modeling of a Mainstream Partial Nitrification/Anammox Process through a Hybrid Theoretical-Machine Learning Approach. ACS EST Water 2025, 5, 1469–1480. [Google Scholar] [CrossRef]
  22. Xu, X.; Du, T.; Guo, D.; Jiang, X.; Zeng, M.; Wu, N.; Wang, C.; Zhang, Z. Deciphering and Predicting Anammox-Based Nitrogen Removal Process under Oxytetracycline Stress via Kinetic Modeling and Machine Learning Based on Big Data Analysis. Sci. Total Environ. 2021, 796, 148980. [Google Scholar] [CrossRef]
  23. Yang, J.; Chen, Z.; Wang, X.; Zhang, Y.; Li, J.; Zhou, S. Elucidating Nitrogen Removal Performance and Response Mechanisms of Anammox under Heavy Metal Stress Using Big Data Analysis and Machine Learning. Bioresour. Technol. 2023, 382, 129143. [Google Scholar] [CrossRef]
  24. Zhang, Y.; Wu, H.; Xu, R.; Wang, Y.; Chen, L.; Wei, C. Machine Learning Modeling for the Prediction of Phosphorus and Nitrogen Removal Efficiency and Screening of Crucial Microorganisms in Wastewater Treatment Plants. Sci. Total Environ. 2024, 907, 167730. [Google Scholar] [CrossRef]
  25. Mu, X.; Jia, F.; Qiu, S.; Li, Y.; Mei, N.; Zhao, X.; Han, B.; Han, X.; Zhang, J.; Yao, H. Predicting and Interpreting Nitrogen Removal Performance and Functional Microbial Abundance of Single-Stage Partial Nitrification and Anammox System Using Machine Learning Methods. Bioresour. Technol. 2025, 437, 133119. [Google Scholar] [CrossRef]
  26. Wolpert, D.H. Stacked Generalization. Neural Netw. 1992, 5, 241–259. [Google Scholar] [CrossRef]
  27. Achraf, R.; Mounir, M.; Moussa, S.M. A New Framework for Energy-Optimized Biological Treatment in Wastewater Treatment Plants Using Machine Learning Techniques. J. Clean. Prod. 2025, 517, 145854. [Google Scholar] [CrossRef]
  28. Rice, J.S.; Emanuel, R.E. How Are Streamflow Responses to the E l N Ino S Outhern O Scillation Affected by Watershed Characteristics? Water Resour. Res. 2017, 53, 4393–4406. [Google Scholar] [CrossRef]
  29. Bakır, R.; Orak, C. Stacked Machine Learning Approach for Predicting Evolved Hydrogen from Sugar Industry Wastewater. Int. J. Hydrogen Energy 2024, 85, 75–87. [Google Scholar] [CrossRef]
  30. Lüftinger, L.; Májek, P.; Beisken, S.; Rattei, T.; Posch, A.E. Learning from Limited Data: Towards Best Practice Techniques for Antimicrobial Resistance Prediction from Whole Genome Sequencing Data. Front. Cell. Infect. Microbiol. 2021, 11, 610348. [Google Scholar] [CrossRef]
  31. Sayers, E.W.; Beck, J.; Bolton, E.E.; Bourexis, D.; Brister, J.R.; Canese, K.; Comeau, D.C.; Funk, K.; Kim, S.; Klimke, W.; et al. Database Resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2021, 49, D10–D17. [Google Scholar] [CrossRef]
  32. Martin, M. Cutadapt Removes Adapter Sequences from High-Throughput Sequencing Reads. EMBnet. J. 2011, 17, 10–12. [Google Scholar] [CrossRef]
  33. Callahan, B.J.; McMurdie, P.J.; Rosen, M.J.; Han, A.W.; Johnson, A.J.A.; Holmes, S.P. DADA2: High-Resolution Sample Inference from Illumina Amplicon Data. Nat. Methods 2016, 13, 581–583. [Google Scholar] [CrossRef] [PubMed]
  34. Bolyen, E.; Rideout, J.R.; Dillon, M.R.; Bokulich, N.A.; Abnet, C.C.; Al-Ghalith, G.A.; Alexander, H.; Alm, E.J.; Arumugam, M.; Asnicar, F.; et al. Reproducible, Interactive, Scalable and Extensible Microbiome Data Science Using QIIME 2. Nat. Biotechnol. 2019, 37, 852–857, Erratum in Nat. Biotechnol. 2019, 37, 1091. [Google Scholar] [CrossRef] [PubMed]
  35. Quast, C.; Pruesse, E.; Yilmaz, P.; Gerken, J.; Schweer, T.; Yarza, P.; Peplies, J.; Glöckner, F.O. The SILVA Ribosomal RNA Gene Database Project: Improved Data Processing and Web-Based Tools. Nucleic Acids Res. 2012, 41, D590–D596. [Google Scholar] [CrossRef]
  36. Liu, J.; Zhu, S.; Liu, X.; Yao, P.; Ge, T.; Zhang, X.-H. Spatiotemporal Dynamics of the Archaeal Community in Coastal Sediments: Assembly Process and Co-Occurrence Relationship. ISME J. 2020, 14, 1463–1478. [Google Scholar] [CrossRef]
  37. Naimi, A.I.; Balzer, L.B. Stacked Generalization: An Introduction to Super Learning. Eur. J. Epidemiol. 2018, 33, 459–464. [Google Scholar] [CrossRef]
  38. Krstajic, D.; Buturovic, L.J.; Leahy, D.E.; Thomas, S. Cross-Validation Pitfalls When Selecting and Assessing Regression and Classification Models. J. Cheminform. 2014, 6, 10. [Google Scholar] [CrossRef]
  39. Cawley, G.C.; Talbot, N.L.C. On Over-Fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation. J. Mach. Learn. Res. 2010, 11, 2079–2107. [Google Scholar]
  40. Raschka, S. Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning. arXiv 2020, arXiv:1811.12808. [Google Scholar] [CrossRef]
  41. Bergstra, J.; Bengio, Y. Random Search for Hyper-Parameter Optimization. J. Mach. Learn. Res. 2012, 13, 281–305. [Google Scholar] [CrossRef]
  42. Speth, D.R.; In ’T Zandt, M.H.; Guerrero-Cruz, S.; Dutilh, B.E.; Jetten, M.S.M. Genome-Based Microbial Ecology of Anammox Granules in a Full-Scale Wastewater Treatment System. Nat. Commun. 2016, 7, 11172. [Google Scholar] [CrossRef] [PubMed]
  43. Moraga, R.; Galan, A.; Rossello-Mora, R.; Araya, R.; Valdés, J. Composición de la comunidad procariota involucrada en la producción de nitrógeno en sedimentos de la bahía mejillones. Rev. Biol. Mar. Oceanogr. 2014, 49, 225–241. [Google Scholar] [CrossRef]
  44. Wen, X.; Zhou, J.; Li, Y.; Qing, X.; He, Q. A Novel Process Combining Simultaneous Partial Nitrification, Anammox and Denitrification (SNAD) with Denitrifying Phosphorus Removal (DPR) to Treat Sewage. Bioresour. Technol. 2016, 222, 309–316. [Google Scholar] [CrossRef]
  45. Azari, M.; Jurnalis, A.; Denecke, M. The Influence of Aeration Control and Temperature Reduction on Nitrogen Removal and Microbial Community in Two Anammox-based Hybrid Sequencing Batch Biofilm Reactors. J. Chem. Technol. Biotechnol. 2021, 96, 3358–3370. [Google Scholar] [CrossRef]
  46. Bovio-Winkler, P.; Guerrero, L.D.; Erijman, L.; Oyarzúa, P.; Suárez-Ojeda, M.E.; Cabezas, A.; Etchebehere, C. Genome-Centric Metagenomic Insights into the Role of Chloroflexi in Anammox, Activated Sludge and Methanogenic Reactors. BMC Microbiol. 2023, 23, 45. [Google Scholar] [CrossRef]
  47. Mardanov, A.V.; Beletsky, A.V.; Ravin, N.V.; Botchkova, E.A.; Litti, Y.V.; Nozhevnikova, A.N. Genome of a Novel Bacterium “Candidatus Jettenia Ecosi” Reconstructed from the Metagenome of an Anammox Bioreactor. Front. Microbiol. 2019, 10, 2442. [Google Scholar] [CrossRef] [PubMed]
  48. Botchkova, E.A.; Litti, Y.V.; Kuznetsov, B.B.; Nozhevnikova, A.N. Microbial Biofilms Formed in a Laboratory-Scale Anammox Bioreactor with Flexible Brush Carrier. J. Biomater. Nanobiotechnology 2014, 5, 76–82. [Google Scholar] [CrossRef]
  49. Kindaichi, T.; Yuri, S.; Ozaki, N.; Ohashi, A. Ecophysiological Role and Function of Uncultured Chloroflexi in an Anammox Reactor. Water Sci. Technol. 2012, 66, 2556–2561. [Google Scholar] [CrossRef]
  50. Choi, D.; Yun, W.; Choi, Y.; Jung, J.; Park, S.; Ju, D.; Bae, H. Insights into Thiosulfate-Driven Partial Denitrification Synergistically Mediated by Anaerobic Ammonium Oxidation: Biosynthesized Signaling Molecules and Enzymatic Collaboration. Chem. Eng. J. 2025, 506, 160069. [Google Scholar] [CrossRef]
  51. Koenig, A.; Zhang, T.; Liu, L.-H.; Fang, H.H.P. Microbial Community and Biochemistry Process in Autosulfurotrophic Denitrifying Biofilm. Chemosphere 2005, 58, 1041–1047. [Google Scholar] [CrossRef]
  52. Lücker, S.; Wagner, M.; Maixner, F.; Pelletier, E.; Koch, H.; Vacherie, B.; Rattei, T.; Damsté, J.S.S.; Spieck, E.; Le Paslier, D.; et al. A Nitrospira Metagenome Illuminates the Physiology and Evolution of Globally Important Nitrite-Oxidizing Bacteria. Proc. Natl. Acad. Sci. USA 2010, 107, 13479–13484. [Google Scholar] [CrossRef] [PubMed]
  53. Zhou, S.; Zhu, W.; He, Y.; Zhang, T.; Jiang, Z.; Zeng, M.; Wu, N. A Comprehensive Analysis of Microbial Community Differences in Four Morphologies of Mainstream Anaerobic Ammonia Oxidation Systems Using Big-Data Mining and Machine Learning. Front. Mar. Sci. 2024, 11, 1458853. [Google Scholar] [CrossRef]
  54. Molinuevo, B.; Garcia, M.; Karakashev, D.; Angelidaki, I. Anammox for Ammonia Removal from Pig Manure Effluents: Effect of Organic Matter Content on Process Performance. Bioresour. Technol. 2009, 100, 2171–2175. [Google Scholar] [CrossRef]
  55. Laureni, M. Mainstream Partial Nitritation and Anammox: Long-Term Process Stability and Effluent Quality at Low Temperatures. Water Res. 2016, 101, 628–639. [Google Scholar] [CrossRef]
  56. Al-Hazmi, H.; Grubba, D.; Majtacz, J.; Kowal, P.; Makinia, J. Evaluation of Partial Nitritation/Anammox (PN/A) Process Performance and Microorganisms Community Composition under Different C/N Ratio. Water 2019, 11, 2270. [Google Scholar] [CrossRef]
  57. Ma, H.; Niu, Q.; Zhang, Y.; He, S.; Li, Y.-Y. Substrate Inhibition and Concentration Control in an UASB-Anammox Process. Bioresour. Technol. 2017, 238, 263–272. [Google Scholar] [CrossRef]
  58. Zhao, X.; Yu, D.; Zhang, J.; Miao, Y.; Ma, G.; Li, J.; Zhang, Y.; Zhi, J.; Dong, G. Enhancing Nitrite Production Rate Made Anammox Bacteria Have a Competitive Advantage over Nitrite Oxidizing Bacteria in Mainstream Anammox System. Water Environ. Res. 2023, 95, e10878. [Google Scholar] [CrossRef] [PubMed]
  59. Zhang, X.; Liu, Y.; Li, Z.R.; Zhang, J.; Chen, Y.; Wang, Q. Impact of COD/N on Anammox Granular Sludge with Different Biological Carriers. Sci. Total Environ. 2020, 728, 138557. [Google Scholar] [CrossRef]
  60. Kartal, B.; De Almeida, N.M.; Maalcke, W.J.; Op Den Camp, H.J.M.; Jetten, M.S.M.; Keltjens, J.T. How to Make a Living from Anaerobic Ammonium Oxidation. FEMS Microbiol. Rev. 2013, 37, 428–461. [Google Scholar] [CrossRef]
  61. Zhang, F.; Peng, Y.; Liu, Y.; Zhao, L. Improving Stability of Mainstream Anammox in an Innovative Two-Stage Process for Advanced Nitrogen Removal from Mature Landfill Leachate. Bioresour. Technol. 2021, 340, 125617. [Google Scholar] [CrossRef] [PubMed]
  62. Zheng, Z.; Li, J.; Ma, J.; Du, J.; Bian, W.; Li, Y.; Zhang, Y.; Zhao, B. Nitrogen Removal via Simultaneous Partial Nitrification, Anammox and Denitrification (SNAD) Process under High DO Condition. Biodegradation 2016, 27, 195–208. [Google Scholar] [CrossRef]
  63. Kartal, B.; Kuypers, M.M.M.; Lavik, G.; Schalk, J.; Op Den Camp, H.J.M.; Jetten, M.S.M.; Strous, M. Anammox Bacteria Disguised as Denitrifiers: Nitrate Reduction to Dinitrogen Gas via Nitrite and Ammonium. Environ. Microbiol. 2007, 9, 635–642. [Google Scholar] [CrossRef]
  64. Kim, I.; Lee, H.H.; Chung, Y.C.; Jung, J.Y. High-Strength Nitrogenous Wastewater Treatment in Biofilm and Granule Anammox Processes. Water Sci. Technol. 2009, 60, 2365–2371. [Google Scholar] [CrossRef]
  65. Yang, S.; Peng, Y.; Zhang, S.; Han, X.; Li, J.; Zhang, L. Carrier Type Induces Anammox Biofilm Structure and the Nitrogen Removal Pathway: Demonstration in a Full-Scale Partial Nitritation/Anammox Process. Bioresour. Technol. 2021, 334, 125249. [Google Scholar] [CrossRef]
  66. Han, X.; Zhang, S.; Yang, S.; Zhang, L.; Peng, Y. Full-Scale Partial Nitritation/Anammox (PN/A) Process for Treating Sludge Dewatering Liquor from Anaerobic Digestion after Thermal Hydrolysis. Bioresour. Technol. 2020, 297, 122380. [Google Scholar] [CrossRef] [PubMed]
  67. Yang, M.; Li, J.; Li, Z.; Peng, Y.; Zhang, L. Enhancing Anammox Bacteria Enrichment in Integrated Fixed-Film Activated Sludge Partial Nitritation/Anammox Process via Floc Retention Control. Bioresour. Technol. 2024, 391, 129938. [Google Scholar] [CrossRef]
  68. Yang, Y.; Zhang, L.; Cheng, J.; Zhang, S.; Li, B.; Peng, Y. Achieve Efficient Nitrogen Removal from Real Sewage in a Plug-Flow Integrated Fixed-Film Activated Sludge (IFAS) Reactor via Partial Nitritation/Anammox Pathway. Bioresour. Technol. 2017, 239, 294–301. [Google Scholar] [CrossRef]
  69. Ren, S.; Wang, Z.; Jiang, H.; Li, X.; Zhang, Q.; Peng, Y. Efficient Nitrogen Removal from Mature Landfill Leachate in a Step Feed Continuous Plug-Flow System Based on One-Stage Anammox Process. Bioresour. Technol. 2022, 347, 126676. [Google Scholar] [CrossRef]
  70. Liu, W.; Shen, C.; Liu, C.; Zhang, S.; Hao, S.; Peng, Y.; Li, J. Achieving Stable Mainstream Nitrogen and Phosphorus Removal Assisted by Hydroxylamine Addition in a Continuous Partial Nitritation/Anammox Process from Real Sewage. Sci. Total Environ. 2021, 794, 148478. [Google Scholar] [CrossRef] [PubMed]
  71. Roots, P.; Rosenthal, A.F.; Yuan, Q.; Wang, Y.; Yang, F.; Kozak, J.A.; Zhang, H.; Wells, G.F. Optimization of the Carbon to Nitrogen Ratio for Mainstream Deammonification and the Resulting Shift in Nitrification from Biofilm to Suspension. Environ. Sci. Water Res. Technol. 2020, 6, 3415–3427. [Google Scholar] [CrossRef]
  72. Wang, Y.; Hu, X.; Jiang, B.; Song, Z.; Ma, Y. Symbiotic Relationship Analysis of Predominant Bacteria in a Lab-Scale Anammox UASB Bioreactor. Environ. Sci. Pollut. Res. 2016, 23, 7615–7626. [Google Scholar] [CrossRef]
  73. Peng, Z.; Lei, Y.; Liu, Y.; Wan, X.; Yang, B.; Pan, X. Fast Start-up and Reactivation of Anammox Process Using Polyurethane Sponge. Biochem. Eng. J. 2022, 177, 108249. [Google Scholar] [CrossRef]
  74. Ya, T.; Huang, Y.; Wang, K.; Wang, J.; Liu, J.; Hai, R.; Zhang, T.; Wang, X. Functional Stability Correlates with Dynamic Microbial Networks in Anammox Process. Bioresour. Technol. 2023, 370, 128557. [Google Scholar] [CrossRef]
  75. Peng, Z.; Zhang, L.; Zhang, Q.; Li, X.; Peng, Y. Establishing a Two-Stage System to Efficiently Treat Real Domestic Sewage by Partial Nitrification-SBR and Air-Lift Anammox-UASB: Reactivating and Enhancing Anammox Bacteria to Optimize the Nitrogen Removal Performance. Chem. Eng. J. 2025, 506, 160333. [Google Scholar] [CrossRef]
  76. Ma, Y.; Wang, B.; Li, X.; Wang, S.; Wang, W.; Peng, Y. Enrichment of Anammox Biomass during Mainstream Wastewater Treatment Driven by Achievement of Partial Denitrification through the Addition of Bio-Carriers. J. Environ. Sci. 2024, 137, 181–194. [Google Scholar] [CrossRef]
  77. Cao, S.; Du, R.; Li, B.; Ren, N.; Peng, Y. High-Throughput Profiling of Microbial Community Structures in an ANAMMOX-UASB Reactor Treating High-Strength Wastewater. Appl. Microbiol. Biotechnol. 2016, 100, 6457–6467. [Google Scholar] [CrossRef]
  78. Song, Y.-X.; Liao, Q.; Yu, C.; Xiao, R.; Tang, C.-J.; Chai, L.-Y.; Duan, C.-S. Physicochemical and Microbial Properties of Settled and Floating Anammox Granules in Upflow Reactor. Biochem. Eng. J. 2017, 123, 75–85. [Google Scholar] [CrossRef]
  79. Gao, M.; Dang, H.; Zou, X.; Yu, N.; Guo, H.; Yao, Y.; Liu, Y. Deciphering the Role of Granular Activated Carbon (GAC) in Anammox: Effects on Microbial Succession and Communication. Water Res. 2023, 233, 119753. [Google Scholar] [CrossRef] [PubMed]
  80. Jia, F.; Peng, Y.; Li, J.; Li, X.; Yao, H. Metagenomic Prediction Analysis of Microbial Aggregation in Anammox-dominated Community. Water Environ. Res. 2021, 93, 2549–2558. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Overall structure of stacked model. Red squares denote the validation set for the current iterative round.
Figure 1. Overall structure of stacked model. Red squares denote the validation set for the current iterative round.
Microorganisms 13 02795 g001
Figure 2. A comprehensive analysis of microbial communities in three different anammox systems. (a) Comparison of the Simpson Diversity and Sobs index among the three systems (**** p < 0.0001); (b) PCoA of microbial community within three systems based on phylum level Bray–Curtis distance; (c) PCoA of microbial community within three systems based on genus level Bray–Curtis distance.
Figure 2. A comprehensive analysis of microbial communities in three different anammox systems. (a) Comparison of the Simpson Diversity and Sobs index among the three systems (**** p < 0.0001); (b) PCoA of microbial community within three systems based on phylum level Bray–Curtis distance; (c) PCoA of microbial community within three systems based on genus level Bray–Curtis distance.
Microorganisms 13 02795 g002
Figure 3. Comparison of bacteria genera across three distinct systems. (a) Venn diagram illustrating the shared and unique genus. Red dots represent the number of shared genera among systems (vertical positions correspond to values in the bar chart above). The number of dots connected by lines indicates the number of systems sharing the genera. (b) The top ten phylum with the highest relative abundance in three different systems; (c) The top ten genus with the highest relative abundance in three different systems.
Figure 3. Comparison of bacteria genera across three distinct systems. (a) Venn diagram illustrating the shared and unique genus. Red dots represent the number of shared genera among systems (vertical positions correspond to values in the bar chart above). The number of dots connected by lines indicates the number of systems sharing the genera. (b) The top ten phylum with the highest relative abundance in three different systems; (c) The top ten genus with the highest relative abundance in three different systems.
Microorganisms 13 02795 g003
Figure 4. Microbial network analysis of ASVs across different systems. (a) The differences of network topological parameters across different systems; (b) Venn diagram of nodes in three networks; (c) Network analysis of ASVs in different systems. Each node represents an ASV, with node size proportional to its degree and color denoting phylum.
Figure 4. Microbial network analysis of ASVs across different systems. (a) The differences of network topological parameters across different systems; (b) Venn diagram of nodes in three networks; (c) Network analysis of ASVs in different systems. Each node represents an ASV, with node size proportional to its degree and color denoting phylum.
Microorganisms 13 02795 g004
Figure 5. Predictive performance of ST-RF model for NRR in the three systems. In this bar chart, blue denotes training set data and orange denotes testing set data. These colors represent predicted result distributions for the model’s training and testing phases.
Figure 5. Predictive performance of ST-RF model for NRR in the three systems. In this bar chart, blue denotes training set data and orange denotes testing set data. These colors represent predicted result distributions for the model’s training and testing phases.
Microorganisms 13 02795 g005
Figure 6. Final investigation on the importance of environmental factor features. The x-axis represented the importance of each feature, which meant the degree of contribution of each feature to the model prediction results.
Figure 6. Final investigation on the importance of environmental factor features. The x-axis represented the importance of each feature, which meant the degree of contribution of each feature to the model prediction results.
Microorganisms 13 02795 g006
Table 1. Comparison of evaluation metrics across models.
Table 1. Comparison of evaluation metrics across models.
Training DatasetTesting Dataset
ModelsR2MAERMSER2MAERMSE
IFAS-PN/AAdaBoost0.9860.0460.0650.9880.0600.090
CatBoost0.8760.1180.1940.8430.1860.325
LGBM0.8950.0880.1790.8480.1600.319
GBM0.7940.1500.2500.7870.2170.378
SVR0.9210.0980.1550.9010.1550.258
ST-RF0.9910.0280.0530.9900.0420.081
ST-XGB0.9910.0270.0500.9890.0400.100
IFAS-SNADAdaBoost0.9940.0280.0580.9930.0260.056
CatBoost0.9030.1730.2300.9030.1590.206
LGBM0.9370.1530.1860.9350.1400.169
GBM0.7960.2850.3340.7880.2620.305
SVR0.9740.0930.1190.9690.0920.116
ST-RF0.9960.0210.0480.9950.0220.045
ST-XGB0.9930.0260.0610.9940.0240.053
UASBAdaBoost0.9800.2980.5090.9770.2800.516
CatBoost0.8830.8161.2200.8990.7531.069
LGBM0.9290.6090.9500.9360.5710.851
GBM0.7811.1401.6690.7951.0611.527
SVR0.8110.6691.5510.8400.5901.347
ST-RF0.9840.2220.4550.9870.1980.384
ST-XGB0.9790.2390.5200.9810.2220.468
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, X.; Ya, T.; Han, L.; Li, W. A Comprehensive Analysis of Microbial Community and Nitrogen Removal Rate Predictions in Three Anammox Systems. Microorganisms 2025, 13, 2795. https://doi.org/10.3390/microorganisms13122795

AMA Style

Zhang X, Ya T, Han L, Li W. A Comprehensive Analysis of Microbial Community and Nitrogen Removal Rate Predictions in Three Anammox Systems. Microorganisms. 2025; 13(12):2795. https://doi.org/10.3390/microorganisms13122795

Chicago/Turabian Style

Zhang, Xuan, Tao Ya, Lu Han, and Weize Li. 2025. "A Comprehensive Analysis of Microbial Community and Nitrogen Removal Rate Predictions in Three Anammox Systems" Microorganisms 13, no. 12: 2795. https://doi.org/10.3390/microorganisms13122795

APA Style

Zhang, X., Ya, T., Han, L., & Li, W. (2025). A Comprehensive Analysis of Microbial Community and Nitrogen Removal Rate Predictions in Three Anammox Systems. Microorganisms, 13(12), 2795. https://doi.org/10.3390/microorganisms13122795

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop